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PREFACE 


The book being offered to the reader is a logical continuation of 
the author’s three-volume general course of physics. Everything 
possible has been done to avoid repeating what has been set out 
in the three-volume course. Particularly, the experiments underlying 
the advancing of physical ideas are not treated, and some of the re- 
sults obtained are not discussed. 

In the part devoted to mechanics, unlike the established tradi- 
tions, Lagrange’s equations are derived directly from Newton’s equa- 
tions instead of from d’Alembert’s principle. Among the books I have 
acquainted myself with, such a derivation is given in A. S. Ivompa- 
neits’s book Theoretical Physics (in Russian) for the particular case 
of a conservative system. In the present book, I have extended this 
method of exposition to systems in which not only conservative, 
but also non-conservative forces act. 

The treatment of electrodynamics is restricted to a consideration 
of media with a permittivity e. and a permeability u not depending 
on the fields E and B. 

Sections 40 and 69 devoted to the energy-momentum tensor are 
appreciably more complicated. They have been included in the book 
because they contain an excellent, illustration of how Lagrangian 
formalism is generalized for non-mechanical systems. A reader to 
whom these sections will seem too difficult may omit them without 
any harm to his understanding the remaining sections of the book. 

I have devoted much attention to the variational principle, with 
the consistent use of the following procedure — first the required result 
is obtained with the aid of methods which the reader is already ac- 
quainted with, and then the same result is obtained using the varia- 
tional principle. The object here was to ensure the reader treating 
the variational principle as a quite reliable and powerful means of 
research. 

An appreciable difficulty appearing in studying theoretical physics 
is the circumstance that quite often many mathematical topics have 
either never been studied by the reader or have been forgotten by 
him fundamentally. To eliminate this difficulty, I have provided 
the book with detailed mathematical appendices. The latter are 
sufficiently complete to relieve the reader of having to turn to 
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mathematical aids and find the required information in them. This 
information is often set out in these aids too complicated for the 
readers which the present book is intended for. Hence, the informa- 
tion on mathematical analysis contained in a college course of 
higher mathematics is sufficient for mastering this book. 

The book has been conceived as a training aid for students of non- 
theoretical specialities of higher educational institutions. I had 
in mind readers who would like to grasp the main ideas and methods 
of theoretical physics without delving into the details that are of 
interest only for a specialist. This book will be helpful for physics 
instructors at higher schools, and also for everyone interested in 
the subject but having no time to become acquainted with it (or re- 
store it in his memory) according to fundamental manuals. 

Igor Savelyev 
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Part One 


MECHANICS 


Chapter I 

THE VARIATIONAL PRINCIPLE 
IN MECHANICS 


1. Introduction 

Depending on the nature of the objects being studied, mechanics 
can be divided into particle mechanics, mechanics of a rigid body 1 , 
and continuum mechanics. The latter, in turn, is divided into hydro- 
dynamics, gas dynamics, the theory of elasticity, the theory of plas- 
ticity, and so on. 

A continuum (continuous medium) is the most difficult object 
for studying in mechanics because it is a system with an infinitely 
great number of degrees of freedom. Besides, methods and equations 
of thermodynamics, electrodynamics, etc. are used in solving a num- 
ber of problems treated by continuum mechanics in addition to 
those of theoretical mechanics. These circumstances are why contin- 
uum mechanics is the most complicated branch of mechanics. We 
shall not deal with topics on continuum mechanics in this book. 

In the general course of physics, problems of mechanics are solved 
with the aid of Newton’s equations. In this chapter, we shall acquaint 
ourselves with a different approach to a description and the studying 
of the motion of mechanical systems. By a mechanical system, we 
shall understand a collection of point particles whose motion may 
either be free or restricted by constraints. Particularly, a collection 
of point particles joined by rigid constraints forms a rigid body. In 
the following, for brevity’s sake, we shall call point particles simply 
particles. 

In accordance with the approach mentioned above, a function of 
generalized coordinates and generalized velocities of a system, and 
also of time, namely, 

L = L (coordinates, velocities, time) 

called a Lagrangian function or a Lagrangian is associated with each 
mechanical system. Generalized coordinates q h are defined to be any 


1 What is meant is a perfectly rigid body. 
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quantities by means of which the position of a system in space can 

be set. Generalized velocities q k are defined to be the time derivatives 
of the generalized coordinates. 

The Lagrangian can be used to characterize not only systems with 
a finite number of degrees of freedom, but also systems with an 
infinite number of them— continuous media and electromagnetic and 
other physical fields. Thus, the significance of the Lagrangian extends 
beyond the scope of classical mechanics. 

Having established the form of the Lagrangian for the mechanical 
system being considered, we can describe the motion of the system 
with the aid of equations relating the partial derivatives of the func- 
tion L with respect to the coordinates and velocities. These equations,, 
known as Lagrange’s equations, replace Newton’s equations. 

The use of Lagrange’s equations instead of Newton’s equations 
has the advantage that the number of the former equals the number 
of degrees of freedom of the system, which, when constraints restrict- 
ing the motion of the system are present, will be less than the 
triple number of particles in the system. The number of Newton’s 
equations needed to describe a system of N particles, on the other 
hand, is 3 N. In addition, Lagrange’s equations do not include the 
reactions of the constraints 1 , which are unknown beforehand. Con- 
sequently, when using Lagrange’s equations, the reactions of the 
constraints are automatically excluded from consideration, and this 
noticeably simplifies the solution of the relevant problem. True, the 
solution in this case gives information only on the motion of the 
system, the values of the reactions remaining unestablished. But in 
the majority of physical problems, the values of the reactions are 
of no interest, so that the data obtained by the method of Lagrange’s 
equations are’quite sufficient. We can indicate as an example the prob- 
lem of the oscillations of a simple pendulum (Fig. 1.1). The equation 
of Newton’s second law for the particle m has the form 

•3S. - W *|S 

where H is the reaction of the thread. Projecting all the vectors onto 
axes x. $f and s (the r-axis is directed beyond the drawing), we obtain 

thite scalar equations: 

mi = R x , my — mg + R„, mz — 0 

If we characterize the position of the system by the generalized 

coordinate <p, one equation will be sufficient instead of three, namely, 

'• l -“ " mZ 2 q> =■ — mgl sin <p (1,1) 


1 This is true only for ideal constraints, i.e. constraints without friction. 
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It is Lagrange’s equation for the given case 1 . It does not contain the 
reaction R. Solution of the equation yields (pas a function of t. We 
shall meanwhile not deal with the form of the function L for the 
system being considered (see Example 1 
in Sec. 6). 

Hence, the motion of a mechanical system 
can be described with the aid of either 
Newton’s or Lagrange’s equations. Natu- 
rally, the latter can be arrived at proceed- 
ing from Newton’s equations (this will be 
done in Sec. 4). A very significant circum- 
stance, however, is that Lagrange’s equa- 
tions can be obtained with the aid of a 
quite general variational principle— the 
principle of least action. It can he used as 
the foundation of classical mechanics instead 
of Newton’s laws. A merit of the principle 
of least action is that it can readily be 
extended to systems that are not mechanical 
or purely mechanical, for example, elastic 
media , electromagnetic fields, and fields of . Fig. 1.1. 
elementary particles. 

Summarizing, the approach to the studying of mechanical systems 
set out in the present chapter can be said to be much more general 
than the method based on Newton’s laws. 



2. Constraints 

For a system of particles with the masses mF\ m< 2 >, . . ., the equa- 
tions of Newton’s second law can be written as follows: 

, m ; Xj — F i (i = i, 2, . . ., n) (2.1) 

where m 2 = m 2 = m 3 = mi 1 ’ is the mass of the 1st particle, m A — 1 
= m a = m 6 is that of the 2nd particle, . . . , x x is the coordinate 

x of the 1st particle, x 2 is the coordinate y of the 1st particle, x 3 
is the coordinate z of the 1st particle, x 4 , x 5 , x 6 are the Cartesian coor- 
dinates x, y , z of the 2nd particle, . . ., F 2 is the projection onto the 
x-axis of the resultant force F* 1 * acting on tho 1st particle. F 2 is the 
projection of the force F* 1 ) onto the y-axis, F 3 is the projection of the 
same force onto the z-axis, F t , F 6 , F e are the x-, y-, z-components of 


1 The generalized velocity enters this equation in the form — <p. We remind 

at 


our reader that dots over a symbol stand for the time derivative: x — dxldt 
and x = tPxIdt*. 
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the' force F (2> acting on the second particle, etc. The number n of 
equations contained in (2.1) equals the triple number of particles 
in the system. 

Restrictions of a geometric or kinematic nature may be imposed on 
the positions and velocities of a system’s particles, These restrictions 
are called constraints. iU Utyfatk , 

Examples of systems with geometric constraints are: 

(1) a particle which in its motion cannot leave a given surface or 
a given curve. The surface or curve may be stationary (a stationary 
constraint) or move in a preset way (a hon-stationary constraint); 

(2) two particles A and B joined by a rigid weightless rod of 
length l. In this case, the restriction imposed by the constraint can 
be written in the form of the equation 

1 u ' (x A ~x B f + (y a - y B )* + (z A z B )» = l 2 (2.2) 

(3) two particles joined by a weightless thread of length !. The 
analytical expression of such a constraint is 

(x A — x B ) 2 + (y A — !/b) 2 + (Za ~ z bY< i 2 , (2-3) 

m.; ' 5'*: : ■ 

(4) a perfectly rigid body which can be considered as a system of 
particles with unchanging distances between them, i.e. experiencing 
constraints like (2.2). 

An example of a system with a constraint of both a geometric and 
a kinematic nature is a ball rolling without slipping over a rough 
surface. The kinematic restriction is that the velocity of the point 
of contact must be zero. 

, In the general case, a geometric constraint can be represented by 
the equation ;j 

f fo, x z , . . ., x n , t) = 0 (2.4) 

i.Ei 

(n — 3 TV, where N is the number of particles in the system). 
..When,. restrictions are imposed on the velocities of particles in 
addition to their coordinates, the equation of a constraint is 

• ,Vi : '"1! . . 

v (^it • • •» X n , Xi-, . v ., Xjit t) = 0 (2.5) 

If Eq. (2.5) can be integrated over time, it is evidently equivalent 
th an equation in the form of (2.4). 

Constraints of the kind given by (2.4) and integrable constraints 
g;iven by (2.5) that can be reduced to them are known as holonomic 
(tir ' integrable) constraints. Systems with such constraints are also 
called holonomic. The systems in Examples 1, 2, and 4 treated above 
belong to the holonomic type. 
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Hence, with holonomic constraints, the restrictions they impose- 
are expressed in the form of equations relating the coordinates of 
the particles and time [see Eq. (2.4)]. 

Non-integrable constraints in the form of (2.5), and also constraints- 
expressed as inequalities (see Example 3), are called non-holonomic. 

Constraints that do not change with time are known as stationary 
(or scleronomous). Constraints that change with time are known as 
non-statlonary (or rheonomous). 

The equation of a holonomic stationary constraint is 

t . 

/ (® l) 3-2 i * • m ®n) == 0 (2.6> 

It differs from Eq. (2.4) for a holonomic non-stationary constraint 
in that it does not include the time t explicitly. 

There are no general methods of solutions for problems with non- 
holonomic constraints. An individual approach to each problem is 
required. We shall not consider non-holonomic systems. 

Every holonomic constraint, i.e. every constraint expressed by 
Eq. (2.4) or (2.6), allows us to represent one of the coordinates as 
a function of the others. Consequently, every such constraint 
diminishes the number of independent coordinates by one. 

Recall that the number of independent quantities needed to 
determine the position of a system in space is called the number of 
degrees of freedom of the system. We can therefore say that every 
holonomic constraint diminishes the number of degrees of freedom 
of the system by one. 

If constraints are absent, a system of N particles has n = 37V" 
degrees of freedom. When there are r constraints, the number of de- 
grees of freedom will be s = n — r — 3 N — r. 

Constraints act on the particles of a system with the forces R<“> 
called reactions. A constraint with no friction is said to be ideal. 
If an ideal constraint is also stationary, its reaction R is always per- 
pendicular to the direction of the possible elementary displacement 
of the particle which the force R is applied to (for instance, in Fig. 1.1 
the reaction R is directed along the thread, while the velocity of 
the particle is perpendicular to it). This is why reactions of ideal 
stationary constraints do not work on a system. If a constraint (even 
an ideal one) depends on the time, the work done by its reaction will 
be non-zero, as a rule. 

The expression for the elementary work done on the particles of 
a system by the reactions of the constraints is dA — 2 Ri dxi. 
We have seen that this work is zero for a system with ideal stationary 
constraints. Consequently, for such systems 

2 dx t — 0 


(2.7) 
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ivhere dx t are the projections onto the coordinate axes of the dis- 
placements of the particles allowed by the constraints imposed on 
them 1 . 


3. Equations of Motion in Cartesian Coordinates 


Consider a system formed by interacting particles. Assume that 
■external forces also act on the particles. We shall use the same sym- 
bols as we did in the preceding section (see the first paragraph of 
Sec. 2) for the coordinates of the particles and the components of 
-£fie iorces. 

In some cases, the forces acting on particles (or at least a part of 
"these forces) can be represented in the form 


"Where' 



(3.1) 


U = U (%, x 2 , . . x n , t) (3.2) 

is ,! a function of the coordinates of the particles and the time known 
•58 the potential of the system. If the function U does not include 
the time t explicitly, it is the potential energy of the system. 

A force determined by formula (3.1) is called a potential force. 
•Stationary and non-stationary potential forces should be distinguished. 
Forces depending only on the coordinates of a particle and not 
•depending on the time explicitly are called stationary. The func- 
tion V not containing the time explicitly corresponds to these forces. 
When U contains t explicitly, the force depends not only on the 
Coordinates, but also on the time and, consequently, will be non- 
••stationary. Stationary potential forces are called conservative. 
• Systems in which only conservative forces act are also called' con- 
servative. 


•‘‘We shall note that in accordance with (3.1), the force acting on the 
particle numbered a can be represented as a gradient of the func- 
tion (3.2): \ 

*' f a = -v«tf (*„,*) (3.3) 

Mere V a is an operator whose components equal the partial deriva- 
tives with respect to the coordinates of the a-th particle. By x t 
here and below in the symbols of functions we denote the set of all 
the coordinates: x lt x 2 , . . ., x n . 


1 In the sums encountered in theoretical physics, the index over which sum- 
mation is performed (the dummy index) is repeated twice, as a rule. In this 
•connection, it is customary practice to omit the symbol and understand sum- 
mation over the twice repeated indices. For instance, simply ajfij is written 
instead of ^atb[. Although this way of writing is distinguished for its brevity, 
however, we shall not use it. Summation will be understood only where the 
symbol ^ is written. 
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Formula (3.3) can also be written as follows: 


F = 

1 O T 


dU 


dr n 


(3-4) 


where r a is the position vector of the particle 1 . 

Assume that part of the forces exerted on the particles of a system 
are potential and part of them are non-potential. Equation (2.1) 
can therefore be written as 


• • ATT 

= —jk + Ft (i=4.2. (3.5) 

where F * are the components of the non-potential forces, and n is 
the triple number of particles. 

Equation (3.5) can be written in a form very convenient for gener- 
alizations. For this purpose, the Lagrangian mentioned in Sec. 1 is 
introduced. It is determined as follows for the system of particles 
we are considering: 

• __ , 

L(x t , x,, f) = t) (3.6) 

i 

Time differentiation of the partial derivative of L with respect to 
x t yields the left-hand side of Eq. (3.5): 
d dL d , • 

— m i Xi 

OXi 

The partial derivative of L with respect to Xi gives the i-th compo- 
nent of the potential force: 

dL _ _ dU 

dxi dxi 

Consequently, we arrive at the relations 

= < i = 1 ’ 2 ’ •••■») ( 3 - 7 ) 

at 0x t ° Xl 

named Lagrange’s equations. 

For systems in which only potential forces act, Lagrange’s equa- 
tions are 

••••») M 


1 By the derivative of the scalar <p with respect to the vector a is understood 
a vector having the components dw/da x , dqlday, d<p/da z . Consequently, the sym- 
bol dqidT stands for a vector with tne components dffldx, dq/dy, dyidz (i.e. grad (p 
or v«p; see Appendix XI). 


2-018 
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Lagrange’s equations can sometimes be written in the form of (3.8) 
even when the forces acting on particles depend on the velocities 
(the Lorentz force is an example of such forces). This can be done 
provided that such forces can be obtained from a certain function 


U*(xi, x t , t) with the aid of the relation 
• ; i p dU* d dU* ■ 

-^1) i -i' '<• l< t dx t ■ dt g ' 

(we leave it to the reader to verify this statement). 


(3.9) 


■ .The function U*.(x ir Xt, t) is known as the generalized potential. 
We shall apply the adjective generalized-potential to forces corre- 
sponding to this potential. When these forces are present, the Lagran- 
gian is written as follows: 

t L(x i ,x l ,t)^2^r--U(x i ,t)-U*(x t ,i i ,t) (3.10) 

‘■V- f : ' • i 

The forces depending on the velocities of particles also include 
dissipative forces. This adjective is used to designate forces always 
directed oppositely to the velocities of particles and, consequently, 
causing their deceleration. Dissipative forces include, for example, 
forces of friction. When dissipative forces are present, the total 
fn'echanical energy of a system diminishes (dissipates), transforming 
into other non-mechanical kinds of energy (for instance, 
into the internal energy of bodies or, as we customarily say, into 
heat). 

The dissipative forces F (d > are often proportional to the velocities 
of particles so that their components along the coordinate axes are 
determined by the equation 


-j F\ d> — — k t Xi (£=1,2 n) (3.11) 

In this case, the dissipative forces can be expressed in terms of Rayleigh’s 
dissipative function 1 equal to 

( 3 - 12 > 

„ .... i 

Indeed, comparing expressions (3.11) and (3.12), we can easily see 
that 

F^ — —J!IL (3.13) 

dxi 

Substituting this expression into formula (3.7) for F* and assuming 
fih'at' there are no othei 1 non-potential forces, we obtain the equation 


d dL dL_ dD__ n 

dt a *_ dXi + 


dxt 


dxt 


(3.14) 


'■/, 1 -This function is customarily designated by the letter F or R. To avoid con- 
fusion, however, we have preferred the symbol D. 
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The function D has a simple physical meaning. The work done by 
dissipative forces during the time is 

dA = ^ dx , = 2 F^it dt=-'2 1 -^x l dt=-2Ddt 

i i i dx t . * 

(we have used Euler's theorem on homogeneous functions, see Appen- 
dix II). This work is done at the expense of the system’s store of 
energy. Hence, the quantity —dA/dt = 2D gives the rate of energy 
dissipation. 

Lagrange’s equations are superior to Newton’s equations (3.5) in 
that, as will be shown in the following section, they retain their form 
when we transform from Cartesian to any generalized coordinates. 
When we pass over to independent generalized coordinates, the 
reactions of the ideal holonomic constraints vanish from the equa- 
tions, which greatly facilitates the solution of problems. 


4. Lagrange's Equations in Generalized Coordinates 

Generalized coordinates q k are defined to be any quantities (lengths, 
angles, areas, etc), that determine the position of a mechanical 
system in space. As an example, we can indicate the spherical coor- 
dinates of a particle: r, ■6', <p. Cartesian coordinates are obviously 
a particular case of generalized coordinates. 

The time derivatives of the generalized coordinates, i.e. the quan- 
• 

tities q k , are known as the generalized velocities of a system. 

The number of independent generalized coordinates needed to set 
the position of a system equals the number of its degrees of freedom. 
In the following, we shall always choose the generalized coordinates 
so that their number coincides with that of the degrees of freedom 
of a system (i.e. so that they are all independent of one another). 

We must note that generalized coordinates are often helpful in 
systems without constraints too (in this case their number coincides 
with that of the Cartesian coordinates). For instance, in solving 
a problem involving the motion of a particle in a central field of 
forces, the spherical coordinates r, ■d, cp are more convenient than 
the Cartesian coordinates x, y, z. 

The following representation is very helpful. Let us introduce into 
consideration a system of coordinates in an imaginary s-dimensional 
space (it is called a configuration space or a g-space). We plot the 
values of the coordinates q h ( t ) along the axes of this system. Hence, 
for each instant t, a point in the configuration space corresponds to 
the position of the system in conventional space. The motion of 
a point in our imaginary s-dimensional space corresponds to the 
motion of the system in the real three-dimensional space. 

2* . 
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We shall prove that Eqs. (3.7) remain true in a transition from the 
Cartesian coordinates x t to the generalized coordinates q k , and also 
that Lagrange’s equations written in independent generalized coor- 
dinates contain no constraint reactions. 

Consider a holonomic system with ideal stationary constraints. 
We shall divide the non-potential forces acting on thesystem’s parti- 
cles into two categories: the constraint reactions and the other 
non-potential forces F*. Equations (3.7) will therefore become 

■vf—g- B > +Ft “- 1 ’ 2 »> 

• i H„ . X ‘ • ' 

Let r constraints expressed by the conditions 

vu ' : . : h (« 1 » ®s, . . ., x n ) = 0 (l = i, 2, . .. ., r) 

be imposed on the system (since the constraints are stationary, the 
function / does not contain the time explicitly). 

; The Cartesian coordinates x t determining the position of a sys- 
tem’s particles can be represented as functions of the generalized 
coordinates q k . If the equations of the constraints do not contain t 
explicitly, we can always choose q h so that the functions expressing x t 
in terms of q h do not contain t, i.e. so that these functions are in the 
lorn 

xi = x t (q u ? 2 , . • ., q t ) (t = 1, 2, . . ., re) (4.3) 

(s is the number of degrees of freedom equal to re — r). In the following, 
for brevity’s sake, we shall write expressions of the kind 
xt (q lt 3 S , . . q s ) in the form x t (q h ). 

In accordance with (4.3), the time derivatives of the functions 
xi ( q h ) have the form ' 

■ i ( 4 - 4 ) 

; . k 

Suihmation is performed over all the q h 's, i.e. the subscript k takes 
on 1 all the values from 1 to s. Expression (4.4) can be written for 
any i from 1 to re. In the following, when this will not lead to mis- 
understandings, we shall not indicate the values taken on by the sub- 
script for which summation is being performed. This subscript is 
called a dummy index. We must note that in summation formulas 
a dummy index may be designated by any letter— the use of one 
subscript instead of another does not change the sum. Particularly, 
expression (4.4) can be written just as successfully, for instance, in 
the form 

! . j i ' ' . . I ; : » 


(4.1) 

(4.2) 
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The quantities dxjdq^ do not contain the generalized velocities q h . 
We can therefore state on the basis of (4.4) that 


dxi 

<><Jh 


Ox/ 


(4.6) 


We must also note that since functions (4.3) do not contain the quan- 
tities q k , the partial derivatives of x t with respect to these quanti- 
ties are zero: 


^ = 0 (4.7) 

d<!h 

Finally we shall obtain another relation that we shall need later 
on. The quantities dxildq k are functions only of the coordinates q h 
(we have already noted that these quantities do not contain the 

velocities q k ). Hence, 


d dxj _ v\ / _dxi_ ) ' _ vi _ 
dt dqk Zi dq, V dq k ) qi £ 


d 2 xi 


dq h ) £-> dqidq h ^ 
l ' l 

Now, we shall differentiate expression (4.5) with respect to q h . Since 
the derivatives dq t ldq k equal zero, we obtain 


dxj _ -<n d / Oxj \ * v) d i 2 xj ‘ 

dq h A dq h l dq t ) qi Z, dq h dq t qi 

l l 


Comparing this expression with (4.8), we arrive at the relation 


dxi d dxi 

dqh ~ dt dq k 


(4.9) 


Having obtained all the required relations, let us now turn to 
the proof. Multiplication of both sides of each of Eqs. (4.1) by 
dxjdqh and summation of all the equations yield 


""S' / d d ^ \ dx > Vi dxi _ vi n dxj vi p, dxi 

Zj [ dt d - Xi ) dq h Zt dXi dq h ~ Z 1 1 dq k ' Zj r i dq k 


(4.10) 


The first of the sums on the right-hand side of this expression equals 
zero. To prove this, we multiply it by dq h : 


i i i 


Here dxi are the increments of the Cartesian coordinates appearing 
when q h receives the increment dq h , the other generalized coordinates 
remaining unchanged. According to (2.7), however, the sum R- t dx t 
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for any dx{ s allowed by their constraints is zero (we remind our 
reader that we assume the constraints to be stationary and ideal). 
Hence, 

i 

and, since dq h ^ 0, we arrive at the conclusion that 

(4.11) 

i 

The quantity 

;'l. (4J2) 

‘i * * ! r ' ' : { 

[see the second term on the right-hand side of Eq. (4.10)] is referred 
to as the generalized force. This name is based on the grounds that 
the expression 

i Qt dq' h = ^ F * <*?*) =' 2 Ft 

i i 


gives the work done by the forces Ff in the displacements of parti- 
cles due to an increase in the generalized coordinate q h by dq h - 
Consequently, the right-hand side of Eq. (4.10) is simply Q*- 
Let us now consider the left-hand side of this equation. We add 

the sum 2 — r- to the minuend and the subtrahend of the left- 

7 ait 

hand side. Hence, the left-hand side of (4.10) will contain the 
difference of the expressions 


X' ( d dL \ dXj . dL dxj 

; - -is f\ dt ^ 7 /^ f dqh 

and 

* • 

; . ;..i ■■■\ ! •, VI dif , yi dL dxi 

2j m dq h + 2j dqh 


(4.13) 

(4.14) 


Expression (4.14) is the derivative of the Lagrangian with respect 
to the generalized coordinate q k , i. e. dL/dq^. 

With a view to (4.9), we can write expression (4.13) in the form 

2 / d dL \ dxi i vi dL' ( d dxi ^ v\ dL dxj 
[ dt dx t 1 dqh i dx t ■ dL dqh ' dt i dx t dq k 
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Finally, substituting dxjdq h for dxi/dq k in accordance with (4.6), 
we arrive at the expression 

d -"o dL dxi 

If 

i oxi dq\ 

d dL 

which is exactly -rr — — . Indeed, according to the rules for the 
1 3q k 

differentiation of a composite function 

dL dL dxi . dL dxi 

T* dxi T* ^ ^ T* T* 

dqk { ' dqu { dx t dq k 

but according to (4.7), all the dxjdqh's equal zero so that the 
first of the sums vanishes. We have thus shown that expression 

(4.13) equals ~ 

dqk 

Therefore, the difference between expressions (4.13) and (4.14) 
is simply ~ — 4--. Introducing this value into (4.10), we 

at dq h 0qh 
arrive at the equations 

<t=1 ' 2 ’ "" s> (4 ' 15) 

that differ from Eqs. (4.1) in containing the generalized coordi- 
nates q h instead of the Cartesian coordinates x t and the generalized 
forces Ql instead of the forces F*. Equations (4.15) do not contain 
the reactions R t of the constraints. 

If all the forces acting on the particles of a system (except for the 
reactions of the constraints) are potential ones, Eqs. (4.15) become 

TT-T-lt- 0 (‘- 1 . 2 . •••.*) <*•«> 

dqk 

Equations (4.15) and (4.16) are Lagrange’s equations in general- 

_ • «' • 

ized coordinates.The function L = L (q ± , g 2 , . . ., q s , q u g 2 , . . ., q,) 
in them is a function obtained from (3.6) by substituting functions 

(4.3) and (4.4) for the quantities x t and x t . 

It can be shown (see Appendix I) that Eqs. (4.15) and (4.16) also 
remain true for holonomic systems with ideal non-stationary con- 
straints. 

We have thus proved that Lagrange’s equations have the same 
form in both Cartesian and generalized coordinates. 

Expression (3.6) for the Lagrangian in Cartesian coordinates shows 

that the derivative of L with respect to x t equals pt— the projection 
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of the momentum of the relevant particle onto one of the coordinate 
axes; 

Ik. = m i x l = p l (4.17) 

ixi 

and the derivative of L with respect to equals F i — the projection 

of the potential force acting on the particle; 

(>lit ' •" ‘ u dL du ' „ 

dxi dxi ~~ 1 


(4.18) 


By analogy with (4.17) and (4.18), the quantity 


P*=- 


dL 

dih 


(4.18) 


is called the generalized momentum, and the quantity 

e»=lr 

the generalized force. 

With the use of these quantities, Eqs. (4.16) can be written as 

' • 1 

(4.21) 


dt 

similar to Newton’s equations 

dpi 

dt 




-r. 


We must note that definition (4.20) is more general than defini- 
tion (4.12) which we have introduced for non-potential forces. If we 
were to extend definition (4.12) to potential forces, i.e. to forces ( that 
can be written in the form F t = — dUldxu we would arrive at 
the expression 

n VI 2 ? dx l W dU dxi dU 

i i 


dx t dq h 


dqn 


(4.22) 


According to definition (4.20), however, 

dL d(T — U) _ dT 


Qw 


du 


dlh dqh dq h dqh (^-23) 

Expression (4.23) differs from (4.22) in the term dTldq k which, as 
will be shown in the following section, generally speaking, is non- 
zero [see formula (5.9)]. 


5. The Lagrangian and Energy 


• 

The Lagrangian L (q h , q h , t) is a characteristic function of a mech- 
anical system. It is quite natural that not only momenta and forces 
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[see formulas (4.19) and (4.20)], but also the energy of a system can 
be expressed in terms of this function. 

The expression for the total energy of a system in terms of the 
Lagrangian is 

E=2-¥-g h -L(g k ,i h ,t) (5.1) 

ft d< lh 


The grounds for determining E in exactly this way will be revealed 
below. Expression (5.1) is more general than the one known from 
the general course of physics, i.e. 

E — T + U (5.2) 


(Here T is the kinetic energy, and U, the potential energy.) Defini- 
tion (5.1) also holds when the total energy cannot be represented as 
the sum of the kinetic and potential energies. 

Let us calculate the total time derivative of the quantity (5.1) 


dE 

dt 


ft 5 9ft ft 


dt 


dL 




V dL ■ 
" dq k 1 k 


dL \ ' dL 

dq h ) qk dt 


In accordance with Lagrange’s equation (4.15), the expression in 
parentheses equals the non-potential generalized force (?*. Hence, 

< 5 - 3 > 

ft 

where W* is the power of all the non-potential forces acting on the 
system’s particles. 

When the Lagrangian does not contain the time explicitly, dLldt — 
— 0, and formula (5.3) becomes 

dE TT7± 


which is known from the general course of physics. 

A glance at (5.3) shows that for conservation of the total energy 
of a system, the absence of non-potential forces is not sufficient. 
It is also essential that the time be absent in the Lagrangian explic- 
itly, i.e. that the system be conservative. We have thus established 
that for a closed system in which only conservative forces act, the 
quantity determined by formula (5.1) remains constant. This is why 
quantity (5.1) is known as the total energy of a system. 

The functions of the quantities q h and q* that in the motion of 
a system retain a constant value determined by the initial conditions 
are called integrals of motion in mechanics. Consequently, the energy 
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of a conservative closed system can be said to be an integral of 
motion. 

Let us find the conditions in which the definition (5.1) transforms 
into (5.2). To do this, we shall investigate the expression for T in 
generalized coordinates. We assume that the constraints are holono- 
mic. We express the Cartesian coordinates x t of the system’s parti- 
cles in terms of the generalized coordinates q h : 

••U." 


Xi = Xi (q u q 2 , . . ., q sl t) (i = 1, 2, . . ., n) (5.4) 


We must note that the time is contained explicitly in these relations 
only for non-stationary constraints. 

We find the total time derivatives of the functions x t . Since the 
■quantities q k are functions of t, the required derivatives have the 
form , 




dxi 

dt 


dx ( 

dt 


+ 2 


dxt 

dqk 


q k (i — 1, 2, 


n) 


.We introduce these expressions into the formula for the kinetic 
energy: 

i i h 

= (5.5) 


In accordance with (5.4), the expressions dxjdt and dx t /dq h are 

functions of q h and t; they do not contain the quantities q k . Conse- 
quently, the first of the sums in formula (5.5) is also a function of 
q h and t. Designating this function by the letter a, we have 

“to.. 0-2- < 5 -«> 

i 


Changing the sequence of summation over i and k in the second 
■of the sums in formula (5.5), we give it the form 

2*{2».-S t &}-2ft.<ft.oi 

h i k 

where: )| • >n-n « 

VZ’Ji to. 0-2 ”=.#■§ <M> 

Finally, representing one of the factors in the third term of (5.5) 
in the form an< ^ t ^ ie ot ^ er factor in the form 

« h h l 
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and then, changing the sequence of summation over i, k, and l, 
we write the third term as follows: 


2 ir 2 

l ft 




■ xi dxi ' * ’ Vi 9*; dxi 

qh 2 1^7 qi—zj swi 2 ~ -far ~do 7 


ft. I 


dxi 

d<lh 


dxi 

d<h 


where 


= 2 Vhj (?;. o mi 

ft, l 


Yfti (?/. *) = 


9 *; 9 *| 
2 9?h 9g ; 


(5.8) 


The expression for the kinetic energy can thus be written as 

T = a ( qj , f) + 2 Pft (?i> 0 9ft + 2 Yfti (Q)> t) Mi (5-9) 

ft ft, i 

With stationary constraints, t is not contained explicitly in func- 
tion (5.4). Consequently, dxildt = 0, so that the coefficients a (qj, t) 
and (q } , t) vanish. In this case, the coefficients y kl do not contain 
the time explicitly. The result is 


T = 2 Yfti (9i) 9ft9i (5.10) 

ft, i 

Therefore, the kinetic energy of a system with stationary con- 
straints is a homogeneous quadratic function of the generalized ve- 
locities q h . 

We established in Sec. 3 that for a system of particles, the Lagran- 
gian in Cartesian coordinates is 

L(x t , x t , 0=2 '- 2 U ( x t ' *) 

l 

In the transition from Cartesian to generalized coordinates, the 
first term on the right-hand side of the above equation transforms, 
generally speaking, into expression (5.9). If we limit ourselves to 
a consideration of stationary constraints, however, the first term 
must be replaced with expression (5.10). The second term becomes 
U (g Jt , t). With stationary constraints, the time is contained explicit- 
ly in U provided that the potential forces are non-stationary. 

We have thus established that for stationary constraints the 
Lagrangian of a particle system written in generalized coordinates is 

£(9ft. 9ft. t) = T (9ft. 9ft )— u (9ft. t) 
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where T is determined by formula (5.10). Using this expression in 
formula (5.1), we get 

,ay , .... E=2jr~ h <lk-T(q k ,q h ) + U(q h ,t) (5.11) 

In accordance with (5.10), T is a homogeneous quadratic function of 

the quantities q k . Consequently, the following relation holds for it 
on the basis of Euler’s theorem (see Appendix II): 




dT 

dqh 


2-^-q h = 2T 


Making such a substitution in (5.11), we arrive at the formula 

E^2T-T + U = T (q ht q h ) + V (q h , t) 

We obtained this formula with only a single assumption, namely, 
that the constraints are stationary. If, in addition, the potential 
forces are also stationary, the potential U ( q k , t) transforms into the 
potential energy V ( q h ), and formula (5.1) transforms into (5.2). 

Definition (5.1) thus coincides with (5.2) when both the constraints 
and the potential forces are stationary. 


(6. Examples of Compiling Lagrange's Equations 

Having selected the independent generalized coordinates conveni- 
ent for describing the system being considered, we must establish 
the form of the functions 

xi = xi ( q k , t) and x t = xi ( q h , q h , t) , 

:• . t.. 

and introduce them instead of the quantities x t and x t into the ex- 
pression for the Lagrangian 

.. L = T(x i )-U(x i , t)=^^-i]-U( Xi ,t) 

i 

The result is the Lagrangian in generalized coordinates: 

' L — L (q h , q h , t) — T (q k , q k , t) — U (q h , t) 

If the expression obtained for L contains terms that do not depend 
on q k and q h , they may be discarded because they do not contribute 

to the quantities dL/dqh and dL/dq h and, consequently, do not affect 
the form of Lagrange’s equations. 

We sometimes succeed in greatly simplifying the operation of 

I . • 

finding the function T (q h , q k , t). This is possible when, it is easy to 
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establish the relation between a particle’s elementary displacement 
ds and the increments of the generalized coordinates q &. For instance, 
in polar coordinates on a plane (Fig. 6.1), the displacement ds is 
a diagonal of a rectangle constructed on the sides dr and r dip (we must 



Fig. 6.1. Fig. 6.2. 

remember the smallness of dip). Consequently, ds 2 — dr 2 + r 2 dcp 2 . 

Dividing this quantity by df 2 , we obtain the square of the velocity 

• • 

of a particle: v 2 — r 2 -f- r 2 < p 2 . Finally, 

T — ~mv z — ~m(r 2 + r 2 cp z ) (6.1) 

For cylindrical coordinates, a third coordinate z is added to the 
previous two (r and tp). The displacement ds is a diagonal of a rectang- 
ular parallelepiped with the sides dr, r dip, and dz. Ilence, 

T = y m (r2 + rV + z 2 ) (6.2) 

Three mutually perpendicular segments (Fig. (5.2) of lengths 
dr, r dq>, r sin ft dtp (the last segment is directed beyond the drawing 1 ) 
correspond to the increments of the polar coordinates r, ■d, «p. The 
displacement ds coincides with a diagonal of a rectangular paral- 
lelepiped constructed on these segments. Therefore, 

r = -i TO (r 2 + r4 z + r 2 sin 2 0<p*) (6.3) 

1 We shall depict segments perpendicular to the plane of the drawing by a 
circle with a dot if the segment is directed towards us, and by a circle with a 
cross if the segment is directed behind the drawing. 
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: .Let us consider several examples. 

•)1.,A Simple Pendulum (see Fig. 1.1). The displacement of the 
particle m is ds — l d<p. Hence, T = y mZ a (p 2 . The potential energy 
is ' tf — — mgl cos cp. Consequently, the Lagrangian is 

1 * 

L — ml 2 g> 2 + mgl cos cp 

We invite our reader to write Lagrange’s equation and convince 
himself that it coincides with Eq. (1.1). 

We find the generalized momentum and generalized force. By 
formula (4.19) '• 

p< p = = ml 2 (. p = mvl 

dtp 

Hence, in the given case, the generalized momentum coincides with 
the moment of the conventional momentum mv (the angular momen- 
tum) relative to the point of suspension of the pendulum. 

By formula (4.20),/ 

i = -mgl sin q> 

and it is exactly the moment N of the force mg relative to the point 
of suspension. We must note that the elementary work is 

r n ' dA = Q<fd<p = Nd(p 

The last expression coincides with that for the work in rotation 
known from elementary mechanics. 

/! Finally, we find the energy of the pendulum by formula (5.1) (we 
can do this in the given case because the constraint is stationary): 

QT • • A • A • ' 

2?= — r (p~ Z/ = ml 2 (p 2 — - ml 2 (p 2 — mgl cos cp = -*■ m (Zcp) 2 — mgl cos cp 
flcp ^ i 

2. A ^"Pendulum with a Uniformly Moving Point of Suspension. 
Assume that the point of suspension of a simple pendulum moves in 
a horizontal direction at the constant velocity v in the plane of 
oscillations of the pendulum (Fig. 6.3). The equation of the con- 
straint is > 

/ ( x , y, t) = (x — vtf + y 1 — l 2 = 0 

(the constraint is non-stationary). 

From the expressions for the Cartesian coordinates 

x — l sin cp + vt, y = l cos <p 

it follows that 

,« |jv • • • • 

x — (l cos <p) <p + v, y = — (l sin <p)q> 
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whence 

T = [Z 2 < p 2 -f- (2 vl cos qp) + v 2 ] 

The potential energy U = — mgy — — mgl cos cp. 

1 

In compiling the Lagrangian, the constant term y mv- may be 
discarded. Consequently 

1 • • 

£ = y m f^ 2c P 2 + (2yZ cos cp) cp] + mgl cos cp 

Notwithstanding the non-stationary nature of the constraint, the 
time does not enter the function L explicitly. 



We invite our reader to find Lagrange’s equations in this and the 
following examples. 

3. A Pendulum with a Point of Suspension Moving with Constant 
Acceleration. Consider a simple pendulum whose point of suspen- 
sion S moves with the constant acceleration a along a straight line 
making the angle a with a horizontal plane (Fig. 6.4). The coordi- 
nates of the point of suspension are 

r s =(jfl c °sa) t 2 ,^y s = (y a sin a) t 2 
and the coordinates’ofjpoint m are 

cos a j t 2 + 1 sin cp, y — ^-L a 'sin a j t 2 -f l cos cp 




$2 ''' n: mechanics ; 


Differentiation of x and y with respect to t yields 

• • • • 
x— (a cos a) 1 -f- (l cos 9) 9, y = (a sin a) t— (l sin 9) <p 

• • 

Introducing these values of x and y into the expression for the ki- 
netic energy, we obtain 

1 • 1 • 

T — y ma 2 t 2 -f mal (cos a cos (p — sin a sin <p) (ft + -j ml 2 (f 2 

The potential energy is V = — mgy = — mg ^ a sin a ) t 2 -f 

-^•’ZcostpJ. In writing the expression for the Lagrangian, we may 

omit the terms j maH 2 in T and — ( mga sin a I t 2 in U because 

they do not contain cp and 9 and therefore cannot affect the form 

of Lagrange’s equations. As a result, 
we find that 

L — mal (cos a cos <p — sin a sin 9) (ft 4 - 

1 " 

-f- -y ml 2 (f 2 -(- mgl cos 9 

The Lagrangian depends explicitly 
on t (this is due to the non-stationary 
nature of the constraint), the time 
having entered L through the agency 
of the kinetic energy T. 

4 . A Particle Moving along a Uni- 
formly Rotating Straight Line. /Assume 
that a particle of mass m expe- 
riences a non-stationary constraint consisting in that the particle 
can move only along a straight line rotating at a constant angular 
velocity (9 = at) in a vertical plane (Fig. 6 . 5 ). In addition to the 
reaction of the constraint, the particle is acted upon by the poten- 
tial force mg. The Cartesian coordinates of the particle are 
* 1 1 ' ' 

■ x — r cos at, y — r sin at 

Consequently, 

• • • • 

x — r cos at — r© sin at, y = r sin at + r© cos at 

The kinetic energy is 

T = y m (r 2 -f- r 2 © 2 ) 

The potential energy is 

U = mgy = mgr sin ©f 
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(when we passed from the Cartesian coordinates to the generalized 
coordinate r, the time t entered the expression for U explicitly). 
Finally, the Lagrangian is 

i 

i = y m (rH r 2 o> 2 ) — mgr sin at 

It depends explicitly on f; now the time has entered L in terms 
of the potential energy U. 

The generalized momentum is 


dL 

p T — -t- = mr = mv 

dr 


where v is the particle’s velocity along the straight line. 

The generalized force 

dL 

Q r = -jp = mra*—mg sin to* 

consists of two terms. The first of them, mroo 2 , is the centrifugal force 
of inertia, and the second, —mg sin at, is the projection of the force 
mg onto the direction r. 

5. A Particle Moving along a Straight Line Rotating with Accelera- 
tion. Assume that the straight line giving rise to a constraint (see 
Fig. 6.5) rotates not uniformly, but with acceleration (<p = at 2 ). 
Hence, 

x = rcosat 2 , y = rsinat 2 . 

• • . 

x = rcosaf 2 — r 2a£sinaf 2 ‘ 


Accordingly, 


y = r sin at 2 + r 2 at cos at 2 
r=|m(r 2 + r 2 4a 2 t z ) 

U — mgy = mgr sin at 2 


L — -~ m (r 2 + r 2 4a 2 t 2 ) — mgr sin at 2 

In this example, the time entered L explicitly in terms of both T 
and U. 


7. Principle of Least Action 1 

I 

Instead of Newton’s laws, mechanics can be based on the principle 
of least action or the Hamilton principle 2 . The action S during the 

, 1 Before beginning to study this section, the reader should acquaint him- 

self with Appendix III. 

2 This principle was established by the Irish mathematician William Hamil- 
ton in 1834. 
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time interval from t x to £ 2 is defined to be the integral 

S=\L(q k ,q ht t)dt • (7.1) 

t. f 

• 

where L ( q h , g ft , f) is the Lagrangian of the system being considered. 
Integration is performed from the instant t x at which the position of 
the system is characterized by the values of the coordinates q k ( t t ) 
to the instant t 2 at which the position of the system is determined 
by the values of the coordinates q h (f 2 ). 

According to the principle of least action, a system moves between 
the positions q h (t x ) and qk ( 1 2 ) in such a way [i.e. the functions 
qk ( t ) have such a form] that the action (7.1) has the smallest possible 
value. Using the notion of configuration space, we can say that a 
point depicting the position of a system moves in this space along the 
curve for which the action S is minimum. 

The Hamilton principle is the most general formulation of the 
la^v of motion of mechanical systems. A merit of this formulation is 
i&ai it can easily be, extended to systems that are not purely me- 
chanical, for instance to elastic media and electromagnetic fields. 

Inspection of (7.1) shows that the dimension of action equals that 
pi the product of energy and time (or momentum and displacement). 
Planck's constant, also called a quantum of action, has the same 
dimension. 

It is a simple matter to obtain Lagrange’s equations from the prin- 
ciple of least action. The action S is a functional (see Appendix III). 
According to the calculus of variations, a functional reaches an 
extreme value provided that its variation is zero. Consequently, 
the principle of least action can be expressed by the condition 

t t 

65 = 6 \ L(q h ,q k ,tydt~0 (7.2) 

,*v 

For this reason, the principle of least action is known as the varia- 
tional principle of mechanics. 

It is known from the calculus of variations (see Appendix III) 
that the variation of a functional of the type (7.1) 1 vanishes if we 
take functions satisfying Euler’s equations (III. 25) as <?&(£). In the 
given case, Euler’s equations have the form 


dL d dL (k—\ 2 

. i • d 1k dt • “ U 

i.e. coincide with Lagrange’s equations. 


Compare with ( 111 : 20 ). In the given ease, the role of /( x, yk, yk) is played 
i>y,L ( t , q h , 9 ft). The role of the independent variable x is played by t, that of 
y,, (x) is played by q h ( t ), and that of yk ( x ), by the function q h (t). 
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We have thus convinced ourselves that the Hamilton principle 
leads to Lagrange’s equations. 

Appendix III shows [see the text following formula (I II. 19)] that 
the addition to the integrand in (7.2) of the total derivative with 
respect to t of any function of the generalized coordinates and time 
does not change the conditions of the extremum, i.e. Eqs. (7.3). 
Consequently, the Lagrangian must be determined to within the 
additive summands that are the total time derivative of an arbitrary 
function of the generalized coordinates and time (a constant is 
a particular case of such a function). We have already noted this 
circumstance in Sec. 4 and used it in Sec. 6. 
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p;Y CONSERVATION LAWS 

Jsd.i !•*:: 

i/J j 

<?rn ; : ■ 

rib- ' i r - - 


8; Energy Conservation 


The conservation laws considered in mechanics are based on the 
properties of space and time. Energy conservation is associated 
with the homogeneity of time, momentum conservation— with the 
homogeneity of space, and, finally, angular momentum conservation 
is associated with the isotropy of space 1 . 

We shall begin with the law of energy conservation. Assume that 
a system of particles is in unchanging external conditions (this occurs 
if the system is closed or experiences the action of a constant 
external force field); the constraints (if present) are ideal and station- 
ary. In this case, the time, owing to its homogeneity, cannot enter 
the Lagrangian explicitly. Indeed, homogeneity signifies the equi- 
valence of all instants. Therefore, the replacement of one instant 
with another without changing the values of the coordinates and 
velocities of the particles should not alter the mechanical properties 
of the system. This is naturally true only if the replacement of one 
instant with another does not change the conditions in which the 
system is, i.e. if the external field is time independent (particular- 
ly, this field may be absent). 

Hence, for a closed system or one in a stationary force field, we 
have dLldt — 0. Therefore, , 



8L 

Oqh 


+ 2 
ft 



( 8 . 1 ) 


If a system is conservative, the motion of the particles obeys 
Eqs. (4.16). In accordance with these equations, we shall substi- 
tute-^- for Expression (8.1) can therefore be written as 

dqh 


1 Homogeneity signifies identical properties at all points. Isotropy signifies 
identical properties at each point in all directions. Homogeneity and isotropy 
are independent of each other. A medium may have different properties at 
different points, the properties at one point differing from those at others, but 
being the same in all directions. Such a medium will be non-homogeneous, but 
isotropic. A medium is possible whose properties are the same at all points, but 
differ (in the same way at all points) in different directions. Such a medium will 
be homogeneous, but anisotropic. It is quite obvious that there may be homoge- 
neous and isotropic media, and also non-homogeneous and anisotropic ones 
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follows: 


dL 

dt 


_ V» d ( dL \ 1 i dL d ' d \\ 9L * 
k \dq k ) k dq h h dq h 


The latter expression can be given the form 



According to formula (5.1), the quantity in parentheses is the ener- 
gy E of the system. We have thus arrived at the statement that 
dEldt = 0, whence 

E = const (8.2) 


Hence, the homogeneity of time leads to the following law: the 
energy of a closed conservative system of particles (or a system in a station- 
ary external force field) remains constant. 

Inspection of (5.1) shows that E is a function of the generalized 

coordinates q h and the generalized velocities q h . Functions of the 

quantities q h and q h that retain in motion a constant value deter- 
mined by the initial conditions are called integrals of motion. Conse- 
quently, the energy of a closed system is an integral of motion. 


9 . Momentum Conservation 

Consider a closed system of particles. Closure signifies that the 
action of external bodies on the system’s particles is negligibly 
small. Owing to the homogeneity of space, the displacement of all 
the particles of the system through an identical length Sr must not 
change the mechanical properties of the system — the Lagrangian 
must retain its previous value. For an unclosed system, such a trans- 
lation would cause a change in the arrangement of the particles 
relative to the bodies interacting with them, which would affect the 
system’s mechanical properties. Therefore, we may state only for 
a closed system of particles that the parallel translation of the system 
as a whole is not attended by a change in the function L (i.e. 8L = 0). 

Assuming the displacement fir to be very small, we can write 

a a 


1 See the footnote on p. 17. 

| 2 dr =| * dx+ d JL dy+ d JL dz . 

dr dr. 1 dy 1 dz 


In accordance with what is said in this footnote. 
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(a is the number of the particle). We took advantage of the circum- 
stance that the displacements 8r a of the particles are the same and 

equal 6r. 

According to our assumption, §r ^ 0. It thus follows from (9.1) 
that 


Lagrange’s equations (3.8) allow us to write 

dL _ d dL d dL 

dx a ~ dt ^ dt dv ax 

dL d dL _ d dL 

dy a ~ dt g ‘ ~ dt dv ay 


dL d dL _ d dL 
dz a — dt d ' z ~ dt dv az 


Multiplying the first, second, and third of these equations by the 
unit vectors e*, e„, and e z , respectively 1 , and summating them, we 
obtain the expression 


dL _ d dL 
dr a dt dv a 


(9.3) 


Equation (9.2) can thus be written as 



(9.4) 


The quantity — — is a vector with the components ^ r~j 
0Va °* ax dx a 

- d , L - = ^ . and = -^-. According to (4.17), these products 

av dy a dVa * dz a 

are the projections of the conventional (not generalized) momentum 
p a of the a-th particle onto the coordinate axes- Hence, 


dL 

dv a " 


(9.5) 


With account taken of this circumstance, Eq. (9.4) will be written 
as follows: 


IT 2 P« = ° 

a 


1 Instead of the symbols i, j, k, we shall designate the unit vectors of the 
coordinate axes by the symbol e with the relevant subscript. 
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Hence it follows that 


P = 2 Pa = const (9-6) 

a 

where p is the total momentum of the system. 

Proceeding from the homogeneity of space, we have thus arrived 
at the following law: the total momentum of a closed system of parti- 
cles remains constant. Consequently, the momentum of a closed sys- 
tem is also an integral of motion. 

10. Angular Momentum Conservation 

Owing to the isotropy of space, the mechanical properties of 
a closed system of particles should not change upon the arbitrary 
rotation of the system as a whole in space. Accordingly, the Lagran- 
gian should also remain unchanged (6L = 0). Let us find the incre- 
ment of the Lagrangian 6L in an arbitrary very small rotation of 
a system through the angle 6<p. All the vectors characterizing the 
system will rotate together with it. As a result, they will receive 
certain increments that will be of the same order as 6q>. According 
to formula (VI. 46) 

6r a = [6<p, rj, Sv a — [6<p, vj (10.1) 

Owing to the smallness of the quantities 6r a and 6v a 

a a 

[we remind our reader that L — L (r ot r a ) = L (r a , v a ), and L does 
not contain the time explicitly]. With a view to (10.1), 

6l = 2 ^ r «) + 2 it [8(f ' Val (10 - 2) 

a a 

It is known from vector algebra that a cyclic transposition of the 
multipliers may be performed in a scalar triple product [see formu- 
la (VI. 3)]. Such a transposition in (10.2) yields 

8L = 2 [>«« - 4tr] + 2 6< p [ v «> -l£] 

a a 

Let us put 6<p outside the sum sign, simultaneously substituting 
-jp d g L - for in accordance with Lagrange’s equations (9.3): 


“-•*{2['..i£]+2h.&]}-«*42[>..-&] 
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According to our assumption, 6<p 0, therefore the condition 

8L = 0 is equivalent to the condition 


or 



a 

2[ r ~^]= const 


a 


According to (9.5), dL/d\ a is the conventional momentum p a . 
The quantity M = [r, p] is the angular momentum (moment of mo- 
mentum) of the relevant particle with respect to the origin of coor- 
dinates. We have thus arrived at the statement that 


M = 2 M a = 2 [r a , p a ] = const (10.3) 

a a 

In this relation, M a is the angular momentum of the a-th particle, 
and M is the resultant angular momentum of the system. 

With a view to the isotropy of space, we have thus arrived at the 
following law: the resultant angular momentum of a closed system of 
particles remains constant. Consequently, the angular momentum of 
a closed system, like its energy and momentum, is an integral of 
motion. 

Assume that a system of particles is in an external central force 
field, i.e. a field in which the force action on any of the particles 
has a direction passing through the same fixed point 0 (the centre 
of the field), while the magnitude of the force depends only on the 
distance r to the particle from this point. The potential energy of 
a particle in such a field is 

V = V (r) (10.4) 

Arbitrary rotation of the system in space about the point 0 does 
not change the mechanical properties of the system (the arrange- 
ment of the particles relative to the force centre 0 remains constant 
in such rotation). Hence, although in the given case the system of 
particles, is not closed, its angular momentum will be constant. 
True, this holds only for the angular momentum with respect to 
the point 0. For a closed system, however, the angular momentum 
with respect to any point is conserved. 

If an external field has axial symmetry (this signifies that the 
potential energy of a particle depends only on the distance R to the 
particle from the given axis), the mechanical properties of the sys- 
tem will not change upon rotation about the axis of the field. There- 
fore, the angular momentum (moment of momentum) of the system 
relative to this axis will be constant (we remind our reader that the 
moment relative to an axis is defined as the projection onto this 
axis of a moment taken relative to any of the points on the axis). 
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11. Motion of a Particle in a Central Force Field 


Consider the motion of a particle in a central field of the kind' 

f/(0 = 7 (H.l). 

where a is a constant that may be either positive or negative. A posi- 
tive constant corresponds to repulsion of the particle from the force 
centre (for instance, the Coulomb force of repulsion), and a negative 
one to attraction of the particle to the centre (the Coulomb force of 
attraction or the force of gravitational interaction of the particle 
with a stationary particle at the centre of the field). 

We established in Sec. 10 that the angular momentum of a particle 
remains constant in a central field, i.e. 


M = [rp] = const 

A vector product is perpendicular to the plane containing the vectors 
being multiplied. It follows that with a constant direction of the 
vector M, the vector r is always in one plane perpendicular to M, 
and the trajectory of the particle is a plane curve. We shall deter- 
mine the position of the particle with the aid of the polar coordi- 
nates rand <p, making the origin of coordinates coincide with the centre 
of the field. In these coordinates, the Lagrangian is [see formu- 
la (6.1)] 

L — — m (r 2 + r 2 q> 2 ) — ~ 


The function L does not contain the coordinate q> explici Ity. Gener- 
alized coordinates not contained in a Lagrangian explicitly are- 
called cyclic. In the absence of non-potential forces. Lagrange’s 
equations corresponding to cyclic coordinates are as follows: 


Hence, 


d_ 

dt 



dL 

Ph = — = 
dqh 


const 


( 11 . 2 )- 


Therefore, the generalized momenta corresponding to the cyclic 
coordinates are constant, i.e. are integrals of motion. 
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In the problem we are considering, Eq. (11.2) has the form 

p<f = = /nr 2 cp — M — const (11.3) 

<5<p 

We could have written this equation immediately, taking into 

-account that mr s cp is the angular momentum of the particle relative 
to the origin of coordinates, which must be conserved in a central 
force field. 

The energy of a particle in a central field is also an integral of 
motion. Consequently, calculations by formula (5.11) allow us to 
write 

E = -i- m (r 2 -f r 2 tp 2 ) + — = const (11.4) 

z r 

To find the trajectory of a particle, it is better to proceed from 
Eqs. (11.3) and (11.4) than from Lagrange’s equations. This way is 
-simpler because Lagrange’s equations contain second derivatives 
•of the coordinates, whereas Eqs. (11.3) and (11.4) contain the first 
•derivatives of the coordinates with respect to time. 

Excluding q> from Eqs. (11.3) and (11.4), we obtain 


E = ~y mr 2 - 


M* 


2mr 2 


whence 


■ dr 1-./" 

r dt m V 


2mE - 


2 am 


M 2 


IFrom Eq. (11.3) 


dtp 


1 M 


^ dt m r 2 


Excluding dt from the last two equations, we find that 
d = ( . Mir 2 ) dr (M/r*)dr 


/ 


2 mE- 


2am 


M 2 


r r‘ 

Introducing the notation 

l2 


/ ( am \ 2 / am , M 

2 mE +(nr) -(-B-t— ) 


+ T+*« (-£*-*) 

we can write that 

du m 




-s 


Y b 2 — u 2 


COS 


' + ?• 


where cp 0 is an integration constant. 
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Returning to our previous notation, we get an equation of a parti- 
cle’s trajectory in polar coordinates: 


cp — cp 0 = cos 


== cos 


-l . 


Mir 

Y 2mE -f- {am/M) 1 
l-f-(M»/aw) (1/r) 

YT+uHifVm^ 


(11.5) 


Inspection of Eq. (11.5) shows that at a preset value of r, the 
difference cp — cp 0 can have two values differing in their sign 
Icos (— a)=cos a). It is thus a simple matter to conclude that the 
curve described by Eq. (11.5) is symmetrical relative to the straight 
line making the angle <p 0 with the axis from which cp is measured. 

To reveal the nature of the curve described by Eq. (11.5), let us 
introduce the notation 


M* 


Y 1 + 2EM z /ma 2 = e 
The equation of the trajectory thus becomes 

1 ± plr 

cp — To = cos 1 f— 

or, after simple transformations, 

r = =F 


( 11 . 6 ) 

(11.7) 


1 — e cos (cp — c 


( 11 . 8 ) 


(the upper sign corresponds to repulsion, and the lower to attraction 
of the particle to the centre of force). 

The equation we have obtained is one of a conic section (or conic — 
see Appendix IV) with the focal parameter p and the eccentricity e. 

Let us first consider repulsion (a > 0). In this case, U> 0 so that 
the total energy E cannot be negative. Hence, by (11.7), we have 
e >■ 1. Consequently, in repulsion, the particle’s trajectory can only 
be a hyperbola. Taking the upper (minus) sign in (11.8), we get the 
equation of the trajectory 


1— e cos (cp — cp 0 ) 


The value of cp 0 is determined by the choice of the reference axis 
for cp. If the angle cp is measured from the axis of symmetry of the 
curve (from the straight line passing through its foci), r should not 
change when the sign of cp changes. This occurs only when ep 0 = 0 or 
<p 0 = it. Assuming that <p 0 = 0, we get the equation 




— P 

1 — e cos cp 
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coinciding with Eq. (IV. 14) that describes the right-hand branch of 
a hyperbola [provided that the origin of coordinates, i.e. the centre 
of force, is placed at the outer (left-hand) focus of the hyperbola]. 

Assuming that (p„ = rt and taking into account that cos (cp — jt) = 
= —cos <p, we get the equation 

r== __-L_ 

1 + e COS q) 

coinciding with Eq. (IV.13) that describes the left-hand branch of 
a hyperbola [provided that the origin of coordinates is placed at the 

outer (right-hand) focus of the 
hyperbola; Fig. 11.1]. 

Now let us consider attrac- 
tion (a< 0). The lower (plus) 
sign in formula (11.8) corre- 
sponds to it. Hence, the equa- 
tion of the trajectory is 

r = ^ (11.9) 

1 — ecoscp ' ' 

for <p 0 = 0, and 

( 11 . 10 ) 

l-(-ecosq) ' ' 

for <p 0 — n - 

As shown in Appendix IV, 
both equations describe either 
an ellipse, or one of the 
branches of ahyperbola,or a pa- 
rabola [see Eqs. (IV. 11) 
and (IV. 12)]. The valfte of e 
determines which of these 
curves we have to do with. 

In attraction, U < 0, consequently the total energy E may be 
either positive or negative, particularly, it may be zero. It follows 
from formula (11.7) that when E >0, the eccentricity is greater 
than one, and the trajectory will be a hyperbola. Equation (11.9) 
gives the right-hand branch of a hyperbola, and Eq. (11.10), the 
left-hand one. Unlike repulsion, the origin of coordinates, i.e. the 
centre of force, is at the inner focus for the given branch (Fig. 11. 2) 1 . 

At E = 0, the eccentricity e—i, and the trajectory will be a parab- 
ola. This case occurs if a particle begins its motion from a state of 
rest at infinity. 

1 The solid curves in Fig. 11.2 are depicted for the same value of the angular 
momentum M and, consequently, for the same value of the focal parameter p. 
The dashed ellipse corresponds to a smaller value of Af. For a smaller M, the 
vertex of a parabola may be to the right of that of a hyperbola corresponding to 
a greater Af. 
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Finally, at E < 0, the eccentricity is less than one, and the tra- 
jectory will be an ellipse. In this case, the curves described by Eqs. 
(11.9) and (11.10) differ in the position of the centre of force. Curve 
(11.9) is obtained if the centre of force (the origin of coordinates) 



Fig. 11.2. 

is at the left-hand focus of the ellipse. Curve (11.10) corresponds t# 
the centre of force being at the right-hand focus. 

If 1 + (2 EAP/ma*) = 0, i.e. E — — ma' 1 /2M i , a glance at for- 
mula (11.7) shows that the eccentricity vanishes — the trajectory is 
a circle. At a given M, such an energy value in the given force 
field is the minimum possible one (an imaginary value of e corre- 
sponds to smaller E's). 

When a particle moves in a restricted region of space (the particle 
does not travel to infinity), the motion is called finite. In infinite 
motion, a particle travels to infinity. Motion in an ellipse is finite 
(recall that in this case E < 0), and motion in a hyperbola or pa- 
rabola is infinite (E^s 0). 

12. Two-Body Problem 

Consider a closed system formed of two interacting particles of 
masses % and m 2 . The potential energy of the system depends on 
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the distance r between the particles. This distance can be treated 
as the magnitude of the vector r drawn from one particle (say m 2 ) to 
the other one (m^: U — U (r) (Fig. 12.1). 

The system has six degrees of freedom. We shall take as the gener- 
alized coordinates the three Cartesian coordinates of the centre of 
mass C of the system (which the position vector R of the point C 



is equivalent to) and the three projections of the vector r onto the 
coordinate axes (the vector r is equivalent to them). The vector r 
can be represented as 

r = n — r 2 , (12.1) 


where r x and r 2 are the position vectors of the particles relative to the 
centre of mass. According to the definition of the centre of mass 

m i*i + = 0 (12.2) 


Solving Eqs. (12.1) and (12.2) simultaneously, we find that 




m 1 + m i 


r, 


m i 




(12.3) 


The position of the particles relative to the origin of coordinates 
is determined by the vectors R + r x and R + r 4 . 

Let us write the Lagrangian of the system: 


L = ^ (R + r ,)* + ^ (R + r 2 )*- U (r) 
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Squaring and taking into account that by (12.2) we have m x r a -f- 
-f m 2 r 2 = 0, we obtain 


t + w 2 


R2 + ^ r 2 + .?bLr$-t/(r) 


• • • 

Finally, substituting r for r x and r 2 in accordance with (12.3), we 
arrive at an expression for L in the “coordinates” R and r which we 
have adopted 1 : 

L «= R2 + .L r 2 - 17 (r) (12.4). 

where 




(12.5): 


is a quantity known as the reduced mass of the system. 

Function (12.4) breaks up into two independent terms: 

L = L (R) + L (r, r) 

The first of them describes the behaviour of the centre of mass of the 
system, and the second, the motion of the particles relative to the 
centre of mass. 

A glance at (12.4) shows that the coordinate R is cyclic (it is not 
contained explicitly in L\ see Sec. 11). Consequently, 

Pa — ( m i + m 2 ) R = const 

(the momentum of the system is conserved; this could have been fore- 
seen because the system is closed). The centre of mass of the system, 
moves rectilinearly and uniformly (or is at rest). 

The motion of particles relative to the centre of mass is described 
by the function 

L = \ r*-U(r) (12.6) 

It can be considered as the Lagrangian of a particle of mass p moving 
in a central field with a stationary centre. The position of a particle 
relative to the force centre is determined by the position vector r. 
Therefore, the problem on the motion of a system consisting of two 
interacting particles (the two-body problem) has been reduced to- 
the problem on the motion of one particle in a central force field. We 
have treated this problem in Sec. 11. We established that when U — 
— air, the trajectory of a particle will be a conic. Consequently, 
the tip of the position vector r = r 2 — r 2 slides in motion of the 

1 The projections of the vectors R and r onto the coordinate axes, and not the 
vectors themselves, are the actual coordinates. 
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particles along a curve that is a conic. By (12.3), the position vec- 
tors r! and r 2 are proportional to the position vector 1 r. Therefore, 
•each of these vectors also circumscribes a conic. Depending on the 
■nature of the interaction (attraction or repulsion) and the magnitude 
•of the total energy of the system, the trajectory of each of the parti- 
cles will be either an ellipse, a parabola, or a hyperbola. 



Fig. 12.2. 


Assume that an auxiliary imaginary particle p moves in an el- 
lipse having the equation 


1— ecos q> 


According to (12.3), the vector r x at any instant has the same direc- 
tion as r (i. e. qp* = cp), while its magnitude is m 2 /(m 1 + m 2 ) 
times greater. Hence, the particle m l moves in the ellipse 


r.— 


Pi 


1— ecoscpj 


(12.7) 


where = prnJimy + rn 2 ). 

The vector r 2 at each instant is directed oppositely to the vector r 
tsee (12.3)]. Consequently, when the vector r is oriented at the an- 
gle <p, the vector r 2 is oriented at the angle tp 2 = cp + n- The magni- 
tude of the vector r 2 is m 1 /(m 1 + m 2 ) times greater. Hence, having 
in view that cos (qp 2 — jt) = — cos cp 2 , the equation of the ellipse 
along which the particle m 2 travels must be written as 


r 2 

where p 2 = pm^m^ -f m 2 ). 


Pi 

1 + ecos <p 2 


(12.8) 


1 For this reason, we must assume that the imaginary centre of force under 
whose action the particle p moves is situated at the point from which the vectors 
Ti and r 2 emerge, i.e. at the centre of mass of the system C. 
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In the case corresponding to Eq. (12.7), the origin of coordinates 
(i.e. the centre of mass C ) is at the left-hand focus of the ellipse [see 
formula (IV. 11)]. In the case corresponding to Eq. (12.8), the origin 
of coordinates (the point C) is at the right-hand focus of the ellipse 
[see formula (IV. 12)]. Consequently, '.lie trajectories of the particles 
are as shown in Fig. 12. 2 1 . 



Fig. 12.3. 



We invite our reader to see for himself that in motion along hyper- 
bolas the trajectories of particles will appear as shown in Fig. 12.3a 
(for mutual attraction of the particles) and in Fig. 12.36 (for repul- 
sion). 

13. Elastic Collisions of Particles 

A collision is defined to bo a process consisting in that particles 
interacting with each other and arriving from infinity (i.e. from 
a distance such that their interaction may be disregarded) approach 
each other, and then either recede again to infinity, or remain at 
a finite distance from each other. In the first case, the collision is 
called the scattering of the particles, and in the second, their capture. 
The latter can obviously be observed only if the interaction of the 
particles has the nature of attraction. 

When we speak about collisions of particles, we do not at all 
assume that the particles come into contact, as is the case, for in- 
stance, in the collision of two spheres. What we have in mind is only 


1 Figures 12.2 and 12.3 are for mi/m t = 2/3; in Fig. 12.2, e — 0.8, and in 
Fig. 12.3, e = 1.5. 
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the fact that owing to their interaction, the particles change the 
direction or nature of their motion. 

Consider the elastic collision of two particles repelling each other. 
A collision is defined as elastic if it is not attended by a change in the 
internal energy of the particles. Consequently, in elastic collisions, 
the mechanical energy of the system of colliding particles remains 
constant. 

It is the simplest to consider a collision process in a reference frame 
associated with the centre of mass of the particles (it is known as 
a c-frame). In practice, however, collisions are observed in a refer- 
ence frame relative to which the centre of mass of the particles 
moves with the velocity v c . This reference frame is called a labora- 
tory one or, more briefly, an Z-frame. In a laboratory frame, one of 
the particles is usually at rest before a collision. 

The following relation obviously holds between the velocity of 
the i-th particle in an Z-frame (we shall designate it by the symbol Vj) 
and the velocity of the same particle in a c-frame [we shall designate 
this velocity by v ( i C) ]: 

V S = v c + v ( t C) (13.1) 

It follows from the definition of the centre of mass that 

V - m 1 \ 1 + m 2 \ 2 
c m 1 + m 2 


In the following, we shall treat only the case when in an Z-frame the 
second particle is at rest before a collision. Therefore, designating 
the velocity of the first particle before the collision by the sym- 
bol v 10 , we have 


v c 




Vio 


(13.2) 


Using this value of v c in formula (13.1), we get the following expres- 
sions for the velocities of the particles in a c-frame before the colli- 
sion: 



v ( c > = 
20 


rra l -f 


(13.3) 


Multiplying the first of these velocities by m x and the second by m 2 , 
we find the momenta of the particles before the collision in the 
c-frame: 


Plo* = f*v 10 


P ( 20 >= — ^ V 10 


Here n = + m 2 ) is the reduced mass of the particles 
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As expected, the total momentum of the particles in the c-frame 
before they collide is zero. It follows from the law of momentum 
conservation that after the collision too, the momenta of the particles 
in the c-frame can differ only in their sign: p< c > — — pl C) . 

The total kinetic energy of the particles as a result of an clastic 
collision] cannot change (we assume that both before and after 
colliding the particles are so far from each other that their mutual 
potential energy is negligibly small). We can therefore write the 
relation 

PlO I P20 ^ Pi | Pi 
2 m 1 2 m 2 2m 1 2 m 2 

[we have omitted the superscript “(C)” on the symbols of the momen- 
ta]. In combination with the condition that | p<<p | = | p<f) | and 
| pCC) I = |p(C)|, the relation we have written indicates that the 
momenta (and, consequently, the velocities) of the particles as a re- 
sult of colliding only turn in the c-frame through a certain angle %, 
remaining constant in magnitude. Let ej stand for the unit vector of 
the first particle’s velocity in the c-frame after the collision. In 
accordance with formulas (13.3), we can therefore write the following 
expressions for the velocities of the particles after the collision: 


v< c > = 

vp = 


,m 1 + m 2 
m. 


ViO*l 


m. 


tTo e i 


To obtain the velocities of the particles after the collision in the 
Z-frame, we substitute expression (13.2) for v c and the above expres- 
sions for Vj C) in formula (13.1). The result is 


f 1 = — -rj — v 10 -1 — y 10 e! 


m, m, 

Vo = r — v,o r — y io e i 

2 m 1 + m 2 lu m 1 + m 2 lu 1 

We obtain the following expressions for the momenta of the parti- 
cles in the Z-frame after the collision: 


m 1 1 

P 1 mj + m, Pio"r PiO 1 

m 2 m . 2 

~ P 10 m 1 -\-m 2 P'° ei 

The lollowmg geometrical construction will be an excellent illus- 
tration of the obtained relations. Let us depict the vector p 10 by 
the segment AD (Fig. 13.1) and let the point 0 on it divide the length 
of the vector in the ratio m, : m 2 . We draw a circle passing through 
the tip of the vector p 10 with the point 0 as its centre. The radius of 
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this circle is p 10 m 2 /(m 1 + m 2 ). If m, < m,, the point A will be 


inside the circle (Fig. 13.1a); if m 1 




> m 2 , the point A will be outside 
the circle (Fig. 13.1b); if m 1 = 
= /n 2 , the point A will be on the 
circle (Fig. 13.1c). We lay off 
from the point 0 at the angle 
X relative to p 10 the unit vector 
ej of the direction in which we 
assume the first particle to be 
flying relative to the c-frame. 
Hence, the segment OB will de- 
pict the vector e 1 p 1 om 2 /(m 1 4-m. 2 ), 
and in accordance with formulas 
(13.4) the segment AB will de- 
pict the vector p x , and the seg- 
ment BD, the vector p 2 . 

The angle 0! between the vec- 
tors Pi and p 10 is called the scatter- 
ing angle. It characterizes the 
deviation of the first particle 
observed in the Z-frame. The 
angle 0 2 between the vectors p 2 
and p l0 is called the recoil angle. 
The sum 0! 4- 0 2 is called the 
divergence angle of the particles 
after the collision. The angles 0 t 
and 0 2 can be expressed in terms of 
%— the deflection angle of the 
first particle in the c-frame. With 
a view to the length of the seg- 
ment OB being p i0 mj (/%+ m 2 ), 
we can write 

tan 8, = 

[Pi 0 m 1 /(nii+’n t >] sin % 

+m s )+[Pi<iPi!/(mi+m2)J cos x 

or 


Fig. 13.1. 


. A _ m i sin X 
tan 1 — m 1 + m 2 cos X 


(13.5) 


From the isosceles triangle OBD. we get the relation 


0 2 = ^- (13-6) 

Inspection of Fig. 13.1a shows that the lighter particle can di- 
verge on the heavier on e(m 1 <m 2 ) in any direction (the point B can 
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be at any place on the circle). The angle of divergence of the parti- 
cles in this case is always greater than it/2. 

When m 1 > m 2 (Fig. 13.16), the scattering angle cannot exceed 
a certain extreme value 0, itn „ x (the point B' in the figure corre- 
sponds to it). The sine of this angle equals the ratio of the segments 
OB' and OA, i.e. 



When mi > ro 2 , the divergence angle of the particles is always less 
than Jt/2. 

If the masses of the particles are the same (m, = m 2 ), the parti- 
cles after colliding fly apart at right angles to each other (Bj -f 
+ 0 2 = ji/2; Fig. 13.1c). 

In a head-on collision, the particles fly apart at the angle 0j - 0 2 
equal either to n (when m 1 < m 2 ; Fig. 13.1a) or to zero (when m 1 > 
> m 2 ; Fig. 13.16). The angle % in a head-on collision is n. When the 
masses of the particles are the same (m 1 — m 2 ), the momentum p x is 
zero, and p 2 = p 10 (see Fig. 13.1c; in this case, the point B coincides 
with the point A). Consequently, particles of the same mass exchange 
momenta in a head-on collision. This result can also be obtained 
quite easily from formulas (13.4). 

The results we have obtained arc a corollary of the laws of energy 
and momentum conservation and do not depend on the nature of 
particle interaction. To determine at what angle x a particle di- 
verges, we must know the law of interaction of the particles and their 
mutual arrangement in colliding. The following section is devoted 
to a treatment of this matter. 


14. Particle Scattering 

We showed in Sec. 12 that the problem on the motion of two inter- 
acting particles reduces to the problem on the motion of a particle 
of mass p (p is the reduced mass) in a central force field, the distance 
from this particle to the centre of forces being equal to the distance 
between the particles in question. After finding the trajectory of the 
imaginary particle of mass p, it is a simple matter to find the trajec- 
tories of both particles. 

We shall use this procedure for studying the process of the diver- 
gence of the particle m 1 by the particle m 2 which is initially station- 
ary in an I-frame. Let us pass over to a c-frame and consider the 
particle p moving in a force field whose centre coincides with the 
centre of mass C of the system. We shall consider the field to be so 
weak at large distances from the centre that the motion of a particle 
at these distances may be considered rectilinear. 
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Let us introduce the impact parameter b equal to the distance 
from the centre of force at which a particle would fly past it if the 
field did not act on it (Fig. 14.1). It is evident that the angle of 
deflection of a particle is a function of the impact parameter: x — 
= %(b), and, generally speaking, % should grow with diminishing b. 
Inversion of this function yields 

b = b (x) (14.1) 

Assume that a beam of identical particles flying far from the 
centre of force C in the same direction and with the same velocity v 0 



Fig. 14.1. 


is incident on the centre. The beam can be characterized by the den- 
sity of the particle flux by which is meant the number of particles 
flying a second through unit area at right angles to the beam. We 
shall assume that the beam of particles is homogeneous, i.e. that 
far from the scattering centre the flux density is the same at all points 
of the beam’s cross section. 

The beam particles deflect through different angles % depending 
on the'impact parameter of a particle approaching the centre. Particles 
whose impact parameter ranges from b to b + db will be scattered 
within the limits of angles from X to x + d%. Let us denote the flux 
of such particles (i.e. the number of particles scattered at angles from 
X to x + dx in unit time) by dN- A - The ratio 

d a =^- (14.2) 

is known as the differential effective cross section of scattering. One 
reason why this name was introduced was the circumstance that da, 
as follows from (14.2), has the dimension of area. It is simple to see 
that da determines the relative number (fraction) of particles scat- 
tered within a given range of angles. 
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When a beam has a homogeneous cross section, the flux of particles 
whose impact parameter ranges from b to b -\-db is j2nb db (the 
flux equals the flux density times the area). This flux is scattered 
at angles from % to x -f dx- Hence, dN x = l2nb db. Using this 
value in formula (14.2), we obtain 


do = i2 y db - = 2nb db 

or, passing over from the variable b to % [see (44.1)], 


da — 2 rcb (x) 



(14.3) 


(we have taken the absolute value of db/dx because db/dx is less 
than zero). 

The quantity dN x in formula (14.2) can be interpreted as the flux 
of particles flying within the solid angle dQ = 2ji sin % d% (this is 
the value of the solid angle confined between cones with apex angles 
of x and x + dx). Substituting dQ/sin x f° r 2 jt d% in (14.3), we can 
reduce the formula for the differential effective cross section of 
scattering to the form 


do = 


b (X) 
sin % 


db 

rfx 


dQ 


(14.4) 


Formula (14.3), like (14.4), is the most general— it determines the 
differential effective scattering cross section for any central scatter- 
ing field. The quantities b (x) and db/dx are determined by the 
nature of the force field, i.e. by that of particle interaction. Con- 
sequently, da is determined by the kind of scattering field and is the 
most important characteristic of a scattering process. We can pro- 
cure information on the nature of a force field by experimentally 
studying da. 

Up to now, we have dealt with the scattering of a beam of parti- 
cles on one scattering centre. In practice, however, scattering occurs 
on a collection of identical scattering centres. In this connection, 
we shall note the following circumstance. The deflection angles x 
are appreciable only for particles that approach the scattering 
centre sufficiently closely (for which the impact parameter is small). 
Therefore, if there are n identical, not overlapping, and sufficiently 
rarefied scattering centres in the path of particles, they will scatter 
the particles independently of one another, and the flux of particles 
deflected within the range of angles from x to X ~r dx will be n times 
greater than when there is only one centre. Hence, with n centres, 
we have 

dN x — n i da (14.5) 


Let us now go over from a particle of mass fx deflected by a station- 
ary force centre at the point C to real particles and m 2 . The 
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trajectories of these particles are geometrically similar to that of 
the particle p. Indeed, according to (12.3), the position vector of 
the first particle emerging from the point C is m 2 /(m 1 -f m 2 ) times 
longer than the position vector of the particle p emerging from the 
same point. A similar relation also holds for the second particle. 
This signifies that the particle m 1 deflects through the same angle % 
as the particle p in a c-frame. The impact parameter must he taken 
equal to the distance at which the first particle would fly past the 
second one if the particles did not interact (the position vector r of 
the particle p emerging from the force centre C coincides with rj— r 2 ). 

Hence, formula (14.3) also holds (in a c-system) for a beam of 
particles m 1 scattered by the particle m 2 . To go over from the c-frame 
to the laboratory reference frame in which scattering is being 
observed, we must transfer from the variable % to the variable 0 X in 
formula (14.3). We use formula (13.5) for this transition. The re- 
sulting formula is very cumbersome in the general case. 

In the particular case when m 1 <m, (the particles being scat- 
tered are much lighter than the ones scattering them), 0 X » % [see for- 
mula (13.5)] so that formulas (14.3) and (14.4) can be written in 
the Z-frame: 


da = 2jiZ> (0,) 
M0i) 


db 




d0i 


da 


db 


dQ 


(14.6) 


sin0! | d0! 

We must note that in this case p ta and r « rj (the trajectory of 
the particle m 1 virtually coincides with that of the particle p). 

Consider a Coulomb scattering field, i.e. a field of the kind U = 
= air , assuming that m 1 <m 2 . The energy of the particle being 

deflected can be represented by the expression E = where v 0 


is the initial (and final) velocity of the particle m v The angular 
momentum of the particle relative to the scattering centre (coin- 
ciding with the particle m 2 ) is M = m^v^b (see Fig. 14.1). Using 
these values of E and M in formula (11.5), we arrive at the relation 


<p — cpo = cos -1 


1 + (m^dVa) (1/r) 
Y 1 + (m^b/a) 2 


(14.7) 


A glance at Fig. 14.1 shows that when r = oo there are two values 
of (p— zero and 2q> 0 . In the first case, the left-hand side of formu- 
la (14.7) becomes — q> 0 , and in the second, +<p 0 . Therefore, assuming 
in (14.7) that r — oo, we can write 


whence 


9o — 


.i X 

QQg l 

Y i+( m i t, oW a ) 2 


i 


cos 2 tp 0 


l + (mii>()&/a) 2 


(14.8) 
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Further, it follows from Fig. 14.1 that % = n — 2cp 0 , i.e. cp 0 = 
= n/2 — %/2 = n/2 — Oj/2 (in the case being considered it is possi- 
ble to assume that 0!== %). Introducing this value of cp 0 into formu- 
la (14.8), we obtain 


sin 2 



1 

1 + (m 1 vlb/a) i 


Solving this relation for 6 , we arrive after simple transformations at 
the expression 


6 = 6 ( 0 ,). 


a 0, 

r COt 

m^yl 2 


(14.9) 


Differentiation with respect to 0! yields 

db _ __a 1 

d0j m^l 2 sin'- (0j/2) 


(14.10) 


Finally, using expressions (14.9) and (14.10) in (14.6), we get for- 
mulas for the differential effective cross section of scattering of the 
particles of massm, in theCoulomb field set up by the particle of mass 
m 2 (here m. : >;/!]): 


da — n 


I a \ 2 cos (Q 1 /2) 

\ ) sin 3 (0,/2) 


d0, 


(14.11) 


/ a \ 2 dQ 
\ 2 m^l ) sin 4 (Oj/2) 


(14.12) 


We have obtained Rutherford’s formula for the scattering of alpha- 
particles on heavy nuclei known from the general course of physics. 
This can be verified by substituting 2 Ze~ for a and multiplying for- 
mulas (14.11) and (14.12) by the flux density /' of alpha-particles 
and the number of atoms n of the scattering substance per unit cross- 
sectional area of the alpha particle beam. The expression ni do will 
be obtained on theleft-hand side of the formula, which gives cZ;Ve, — 
the flux of alpha particles scattered in the range of angles from 0, to 
0j 4- dQ j, or dN a — the flux of alpha particles scattered in the solid 
angle dQ [see formula (14.5)]. 

We must note that the expressions we have found for da do not de- 
pend on the sign of a so that the result obtained holds not only for 
the Coulombian repulsion of tho particles m, and m.,, but also for 
their Coulombian attraction. 


15. Motion in Non-lnertial Reference Frames 

The Lagrangian of one particle has the form 

Z, = i-mv 2 -t/(r) (15.1) 

only in inertial reference frames. Let us find the form of L in an 
arbitrary non-inertial reference frame. Figure 15.1 depicts the iner- 
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tial reference frame K and the frame K' whose origin (the point O') 
moves in the frame K at the velocity v 0 ( t ). The frame K', in addition, 
rotates relative to the frame K at the angular velocity to (f). Let 
us express the function (15.1) in terms of the vector r' determining 

the position of a particle in the 
frame K' , and in terms of the 
velocity v' of the particle obser- 
ved in the same frame. 

We assume first that v 0 ( t ) = 
= 0 and that the origins of both 
reference frames coincide. Hence, 
the following relation would hold 
between the velocities of the par- 
ticle in both frames: 

v = v' + Iwr'] (15.2) 

(a particle that is stationary in 
the frame wouldhave a veloc- 
ity equal to lor'] 1 in the frame K). If v 0 ( t ) is non-zero, on the other 
hand, relation (15.2) becomes 

v = v 0 (t) -f v' 4 tor'] (15.3) 

Let us introduce the expression we have obtained for v into 
formula (15.1). This yields 

4- mv 0 ( t ) v' + mv o ( t ) [tor'] 4 m\' [car'] (15.4) 

The first term in this formula is the preset function of time, which 
can be represented as the total derivative with respect to t of another 
function. We established in Sec. 7 that the Lagrangian must be 
■determined to within the additive terms that are the total time 
derivative of an arbitrary function of the generalized coordinates 
and time. For this reason, the term (m/2) [v 0 (t)] 2 should be omitted. 

Consider the fourth and fifth terms in formula (15.4). Factoring 
otit m\ 0 ( t ), these terms can be written in the form 

{v' + [ar'J) - i»v, (I) (4 ’44’ r ’ ]} 

—»,(() (15.5) 

Here d'r’ is the increment of r' observed during the time dt in the 
frame K' (we remind our reader that v' is the velocity of the particle 


1 This expression is obtained from formula (VI. 46) if we assume that a = r' 

in it and divide the relation obtained by dt. 
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observed in the frame K'), and d< p is the angle through which the 
frame K’ turns during the time dt. 

If a reference frame rotates relative to another one, the increment 
of a vector n observed in both frames will be different. This is easy 
to understand by assuming that the vector does not change with 
respect to the rotating frame, i.c. the increment of the vector in 
this frame (we designate it by Ii') is zero: d ' a = 0. Consequently, 
the increment of the vector in the stationary frame (the frame K) 
can be written as 

da — [d<p, a] 

[see formula (VI. 46)]. If the increment of the vector d'a observed in 
the rotating frame is non-zero, the increment observed in the sta- 
tionary frame will be 

da = d'a + [dip, a] (15.6) 

[assuming that a =r' and dividing by dt, we arrive at formula (15.2)]. 

By comparing (15.6) with the expression in braces on the right- 
hand side of formula (15.5), we arrive at the conclusion that this 
expression is the increment of the vector r' observed in the frame K, 
i.e. dr'. Hence, the sum of the fourth and fifth terms of formula (15.4) 
can be written as 

dt' 

mv o (0 — 

Let us transform this expression as follows: 

mv o ^-ir=ir o (0 r ') - mt ' irr 

We may discard the first term as the total time derivative of the 
function of the coordinates and time. In the second term, dxjdt is 
w 0 (t )— the acceleration of the origin of coordinates of the frame K' 
observed in the frame K. 

We have thus arrived at the following expression for the Lagran- 
gian in the variables r' and v': 

L' — 4*mv ,2 -j--|- m [tor'] 2 — mr'w 0 (t) -f my’ [tor'] — U (r') (15.7) 

We have obtained the general form for the Lagrangian of a parti- 
cle in an arbitrary non-inertial reference frame. We must now consid- 
er that the function U is set in the variables r' [in formula (15.1) it 
was set in the variables r]. The transition from one set of variables 
to another is accomplished by the formula 

r = r 0 (t) + r' (15.8) 

where r () ( t ) is the position vector of the origin of coordinates of the 
frame K' (see Fig. 15.1). 
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We must note that even if the time were not contained explicitly 
in the function (15.1) (it could have been contained in the force 
function U), the function (15.7) contains the time because \v 0 and © 
are, generally speaking, functions of t. The time also enters explicitly 
the term U (r') as a result of the transition from r to r' accomplished 
by formula (15.8). 

Before beginning to compile Lagrange’s equation, let us replace 
the second term of expression (15.7) in accordance with formula (VI. 6). 
The result is 


L' = --mv' 2 -r-—-mb) 2 r' 2 — m (©r') 2 

— mr'w 0 ( t ) + m\' [©r'J — U (r') (15.9) 

Taking advantage of a cyclic transposition of the multipliers [see 
formula (VI. 3)], the next-to-last term equal to mv' [®r'[ could be 
written in the form 

m [v'©]r' (15.10) 


Lagrange’s equation in the frame K' is as follows: 

' d dV aL 
dt d\' dr' 


(15.11) 


[see formula (9.3) and the footnote on page 17]. A glance at expres- 
sion (15.9) shows that 

dL' , 1 r M 

_ = mv +m [©r ] 

whence 

~di = mY + m f® 1 ” ] + m i wr J 

We remind our reader that from the very instant when we expressed 
L in the variables r' and v', we have been “living” in the reference 

frame K'. Consequently, by v' we must understand the accelera- 
tion w' of a particle observed in the frame K', and by r', the veloci- 
ty v' of the particle in the same frame. Hence, 

~ = mw' + m[©r'] + m[©v'] (15.12) 

As regards ©, it is the time derivative of the function © ( t ) that is 
set in the frame K. 

In calculating dL'/dr’, we shall assume that the next-to-last term 
in formula (15.9) is represented in the form (15.10). We thus obtain 
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The first two terras in this expression are the triple vector product 
m [(<>, [r'o>]] written according to formula (VI. 5). Expression (15.13) 
can therefore be written as 

= [r'<oj] — mw 0 (t) -f- m [v'g>] — ~r (15.14) 

Introducing expressions (15.12) and (15.14) into formula (15.11) 
and performing transformations, we arrive at an equation of motion 
of a particle in the frame K 

mw' = — — mw 0 ( t ) + m [r'to] -)-ra [to, [r'®]] -f2m [v'ca] (15.15) 

We see that the acceleration of a particle in the frame K' is deter- 
mined, in addition to the force —dU/d r' due to the force field, by 
a number of additional forces called, as is well known, forces of 
inertia. The term m[ o, [r'oi]] gives the centrifugal force of inertia, 

and the term 2m [v'©], the Coriolis force. The force m[r'(o] is associat- 
ed with the non-uniformity of rotation; it has no special name. 
If the frame K' has only translational motion relative to the frame 

K (in this case to = 0 and, consequently, a> also equals 0), the 
equation of motion contains only one force of inertia equal to 

fin = — ™v 0 (f) (15.16) 

It is remarkable that this force, like the force of gravity mg, is 
proportional to the mass of the particle. This circumstance underlies 
the general theory of relativity. 

For a uniformly rotating system of coordinates having no trans- 
lational acceleration tw 0 (f) = 0, w = 0], the Lagrangian has the 
form [see (15.7)) 

L ' = J2f. + JlI^L + mv ' [cor'] — U (15.17) 

Let us find the momentum, the angular momentum, and the energy 
of a particle for this case. By formula (9.5) 


Taking the derivative of function (15.17) with respect to v', we get 
p' — mv' + m [cor'] = m (v' + [wr'3} (15.18) 

If the frame K' has neither a translational acceleration nor a trans- 
lational velocity (v 0 = 0), inspection of (15.3) shows that the ex- 
pression in braces in (15.18) is the velocity v of a particle relative to 
the inertial frame K. Hence, p' equals mv, i.e. it coincides with the 
momentum p of a particle in the inertial frame: 

p' = p 


(15.19) 
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Further, if the origins of the frames K and K' coincide [see 
Fig. 15.1)], the position vectors r and r' also coincide. Hence, with 
a view to (15.19), it follows that the angular momentum M' — 
= [r'p'I in the frame K' coincides with the angular momentum 
M = [rp] in the frame K : 

M' = M (15.20) 

By formula (5.1), the energy of a particle in the frame Ii' is deter- 
mined by the expression 

k-L' 

i »•; 

where x\ are the Cartesian coordinates of the particle in the frame K'. 
According to (4.17), dL'!dx\ is p [ — the projection of the momentum 

p' of the particle onto the i-th coordinate axis, and x\ is the projec- 
tion of the velocity v' of the particle onto the same axis. Consequent- 
ly, the expression for the energy can be written as 

E' = p'v' - L' (15.21) 

Substituting for p' its value from (15.18) and for L' expression (15.17), 
we get the following formula: 

£'==I2p + f/_i[©r'J2 (15.22) 

Rotation of the reference frame manifested itself in the appearance 
of the term 

(15.23) 

not depending on the particle’s velocity v' in the expression for the 
energy. This additional “potential” energy is called centrifugal. 

Let us substitute v —[or'] for v' in formula (15.22) [see (15.3); 
we assume that v 0 ( t ) — 0]. The result is 

E’ = JE£- + U~mv [®r'J (15.24) 

The first two terms give the energy A of a particle in the frame K. 
If the origins of the frames K and K' coincide, r' may be replaced 
by r. The last term in (15.24) by means of cyclic transposition can 
now be giver, the form 

m\ [car] = © [r, mv] = ©M 

Therefore, the following relation holds between the energies E and 
E' of the particle in the frames K and K' respectively: 

E' — E — ©M 


(15.25) 
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We remind our reader that this formula has been obtained assuming- 
that the origins of both reference frames coincide. Consequently, 
instead of M in formula (15.25), we can write M' [see (15.20)]. 

Summarizing, if the reference frame K' rotates uniformly relative 
to the reference frame K , and the origins of both frames coincide, 
the momentum and the angular momentum of a particle in both 
frames coincide, while the energy of the particle in the frame K' is 
less than that in the frame K by the magnitude of the scalar product 
of the vectors « and M. 


Chapter IV 


SMALL-AMPLITUDE 

OSCILLATIONS 


16. Free Oscillations of a System Without Friction 

Consider a system with one degree of freedom in which friction 
forces are absent. The potential energy of such a system has the 
form U = U(q), where q is a generalized coordinate. The potential 
energy is known to be minimum in the position of stable equilibri- 
um. We shall measure gfrom this position. Let us expand the func- 
tion U (q) in powers of q in close proximity to the point q = 0. 
Owing to the smallness of q , we shall limit ourselves to the first terms 
of the expansion: 

U(q) = U(0) + U'(0)q+±-U"(0) q » 

The condition of equilibrium yields V (0) = 0. Let us measure 
the potential energy from the equilibrium position, i.e. assume 
that U (0) = 0. Finally, let us introduce the symbol U" (0) = x 
(remember that the second derivative is positive at a point of a min- 
imum, hence, x > 0). As a result, we arrive at the expression 

U (?) ~ (1 G. 1) 

We shall consider the constraints to be stationary. Therefore, 
by (5.7) 

T = y{q)q* 

In passing through the equilibrium position, T does not vanish. 
Consequently, y (0) is non-zero. Expanding y (q) into a series and 
retaining only the zero term of the expansion owing to the smallness 
of q, we can write 

t-ML ( 16 . 2 ) 

2 

where p, = 2y (0) (do not confuse it with the reduced mass!). 

Let us compile the Lagrangian: 

r _ M* 

L ~ 2 2 


(16.3) 
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Lagrange’s equation 

p? -f v.q = 0 or q -f (o 2 g = 0 (16.4) 

[here co 2 = (x/p);>0] is a linear homogeneous second-order differen- 
tial equation with constant coefficients (seo Appendix V). Using 
the substitution q = e M , we arrive at the characteristic equation 1 

X 2 -f col — 0 

The roots of this equation are Xj = +ico 0 and = — ico 0 - Con- 
sequently, the general solution has the form 

q — C i<? i05 ° ( -f C z e~ im o l (16.5) 

where C x and C. 2 are complex constants. 

The values of q must be real; this signifies that the condition q* = 
= q (q* is the complex conjugate of q) must be observed. Introducing 
expression (16.5) for q into this condition, we obtain 
C* e -ia> 0 t -f — C x e iu> o' -f C 2 e~ ia> ot 

The above relation is observed if C x = C\ (correspondingly C* = C 2 ). 
Having this in view, we shall write the coefficients C x and C 2 as 

Ci = ~e^, C t = \e~* (16.6) 

(a and a are arbitrary real constants). The use of these values in 
formula (16.5) yields 

q — ~ (e‘ e ~ l (“o<+“)) = a cos (a 0 t -fa) (16.7) 

Therefore, the free motion of the system near the position of 
stable equilibrium has the nature of a harmonic oscillation (natural- 
ly, provided that q remains small in the process of motion). 

It is known from the general course of physics that a is called the 
amplitude, a — the initial phase of the oscillation, and co„ — the natu- 
ral frequency of the system 2 . 

Let us transform expression (16.7) according to the formula for 
the cosine of a sum: 

q — a (cos a cos u> 0 t — sin a sin a» 0 0 

and introduce the notation 

c x — a cos a, c 2 = —a sin a 
The solution of Eq. (16.4) can thus be written as 

q — c x cos co 0 £ + c 2 sin (o 0 t (16.8) 


1 See formulas (V.7) and (V.9). 

2 As a rule, we shall not give the information on a question being considered 
that can be found in textbooks of general physics. 
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where c x and c 2 are real constants whose values are determined from 

the initial conditions [from q 0 and (g)„]. 

Finally, we shall give another form of writing a harmonic oscil- 
lation 1 : 

q = Re {<?} = Re {Ae ia ° 1 } (16.9) 

where 

A = ae la (16.10) 

is a complex amplitude; its magnitude equals the ordinary amplitude, 
and its argument equals the initial phase of the oscillation. Introduc- 
ing the value of A from (16.10) into (16.9) and taking the real part 
of the expression obtained, we arrive at formula (16.7). 

Consequently, a harmonic oscillation can be represented in the 
form of any of the three formulas (16.7), (16.8), or (16.9). 

17. Damped Oscillations 

In a real oscillatory system, forc'es act that retard the motion of 
the system and lead to a gradual attenuation of the amplitudes 
(damping) of the oscillations. The mechanical energy of the system 
transforms into the internal energy of the system and the surroundings 
(for brevity, the energy is usually said to transform into heat, but 
this is not quite strict). Such a process is called the dissipation of 
energy. 

We shall limit ourselves to a treatment of cases when the gener- 
alized force of friction retarding a system is proportional to the 
generalized velocity of the system: 

Q * = ~rq : , 

This is a non-potential force, therefore Lagrange’s equation will 
have the form of (4.15), and the function (16.3) must be taken as L. 
Hence, damped oscillations are described by the equation 

• • • 

\iq + nq = — rg 

(the symbols p and x have the same meaning as in the preceding 
section). Let us write this equation in the form 

q + 2pg + = 0 (l 7 -*) 

where 

co* = -~->0 and 20 = y 

The substitution q — e *>* leads to the characteristic equation 
. W + 2pX + — 0 (17.2) 

1 We shall use a cap over a symbol to designate complex quantities. 
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Provided that p 2 < wg, the roots of the characteristic equation are 
complex: 

— P + tV"© 2 — P 2 , X 2 = — p— rj/© 2 — p 2 
The general solution of Eq. (17.1) is 

q = i‘ + C«e x ** = e -P< (Cje*®* + C 2 e- |w< ) 

where © = ]/©§ — P 2 . The solution we have found differs from the 
function (16.5) in the factor e -f3! and in the substitution of © for © 0 . 



The requirement that q be real leads to the condition C x — C*. 
Introducing the notation (16.6) and performing elementary transfor- 
mations, we arrive at an expression for damped oscillations: 

q = ae~ pt cos (at + a) (17.3) 

When p 2 > ©§, the roots of characteristic equation (17.2) are real: 

— P + l/p 2 — w*=— <*i, X 2 = — p — ]/p 2 —'co 0 2 = — a 2 

(since }/ p 2 — ©$ < P, the quantity ai is positive; the quantity a a 
is also positive, and a 2 > c^). The solution in this case is 

q = Ctf- 1 *!* +:C 2 <r a *! (17.4) 

where Cx and C 2 are real constants, r ■ . 

i Hence,' with strong friction (when p? > ©o), no oscillations 
appear— the system brought out from, its equilibrium position 
returns to it asymptotically. The motion of the system may have 
the nature described either by curve 1 or curve, 2 (Fig. 17.1). In, the 
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latter case, the system first passes through the equilibrium position, 
deviates to the other side of it, and only then approaches the equilib- 
rium position asymptotically. Such motion of a system is called 
aperiodic damping (or an aperiodic process). 

How a system will return to its equilibrium position (following 
curve 1 or curve 2) depends on the ratio of the coefficients C 1 and C 2 
which, in turn, is determined by the initial conditions [i.e. by the 
values of the generalized coordinate q 0 and the generalized velocity 

v 0 — (q ) o at the initial instant]. 

Let us establish the conditions in which aperiodic motion has a 
specific nature. We express the coefficients C x and C 2 in terms of 
q 0 and v 0 . Assuming in (17.4) that t — 0, we get 

go = C, + C 2 (17.5) 

Differentiating (17.4) with respect to time and assuming that t = 0 
in the expression obtained, we find that 

*>o = (?) o = — a 1 C 1 — a 2 C 2 
It follows from Eqs. (17.5) and (17.6) that 

( i Pagp-h^o /-> g i?o~h i; » 

a 2 — a x ’ 2 a 2 — o x 

We equate expression (17.4) to zero: 

+ C 2 e-«2 f = 0 

When aperiodic damping occurs according to curve 2 (see Fig. 17.1), 
Eq. (17.8) must have a finite positive solution. Solving this equation 
for. t, we obtain 

* = i n ( ) — — - In a ^ .? ± Yo 

a 2 — ctj \ C x / a 2 — a x a 2 g 0 + ^o 

'[we have introduced the values of C x and C 2 from (17.7)]. The differ- 
ence a*, — a x is greater than zero (see above). Therefore, t will be 
positive when the expression inside the logarithm symbol is greater 
than +1- The latter condition is observed if the expressions 
(a'iq 0 + v 0 ) and (a 2 ? 0 + v o) h ave same signs and, in addition, 
the magnitude of the first expression is greater than that of the 
second one: 

" 1 sgn (atfo + 1 ? 0 ) = sgn (a 2 q Q + v 0 ) 

l a l ?0 + y ol>l a 2?0 + l; ol 

The coefficients a x and a 2 are positive, and a 2 > a x . Hence, for 
Satisfying the second of conditions (17.9), q 0 and v 0 must have differ- 
ent signs. This occurs if the initial velocity is directed towards the 
equilibrium position [when the system is deflected to the right 


(17.9) 


(17.6) 

(17.7) 

(17.8) 



SMALL-AMPLITUDE OSCILLATIONS 


69 


(q o > 0), the velocity is directed to the left (i>„ <; 0), and vice versa]. 
Figure 17.2 shows graphs of the functions y = a.-yq 0 + v 0 and y = 
— ocog 0 + v 0 . The graphs have been plotted for q 0 >» 0, therefore 
v 0 c 0. The values taken on by q 0 are divided into three regions. 
It is easy to see that both condi- 
tions (17.9) are satisfied only in ^ 
region /, i.e. at g 0 ’s not exceeding 
— v 0 /a 2 . In region II, the first of 
the conditions is not observed, and 
in region III, the second one. 

Hence, aperiodic damping occurs 
in accordance with curve 2 (see 
Fig. 17.1) when v 0 and q 0 have 
different signs and, in addition, 

l7ol<^-° r KI>a«|?ol (17.10) 
a 2 

(we remind our reader that a 2 = 

=p + /F^T)- 

Special attention must be given F >g- 17.2. 

to the case when characteristic equa- 
tion (17.2) has multiple roots. This occurs provided that p 2 = 
= coo- Consequently, Xj == ,= — [5. According to formula (V.ll), 

in this case the general solution of Eq. (17.1) is 

q = Cje-P' + C 2 te~V = (C 1 + C a t) e 

After the relevant calculations, we find that 



Ci = q 0 and C 2 = pg 0 + v 0 

From the condition q — 0, we get (except for t = oo) the value 

j £i Qo 

Ci P?o + y o 

It will be positive if 


P?0 + Cfl 


■<0 or 


P9o+Po 


(multiplication by p does not change the sign of a quantity because 
P > 0). The last condition is observed when the sign of v 0 is opposite 
to that of q 0 and, in addition, 


I F o 1 > P I 7o I 


(17.11) 


Hence, with multiple roots, aperiodic damping can also occur 
either monotonously (see the curve 1 in Fig. 17.1) or with passing 
through the equilibrium position (see the curve 2 in Fig. 17.1). The 
latter case occurs if a system brought out of its equilibrium position 
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by <70 receives an impetus towards the equilibrium position, impart- 
ing a sufficiently high initial velocity to it [a velocity satisfying 
condition (17.11); when the roots are different, the velocity must 
comply with condition (17.10)]. 

18. Forced Oscillations 

Assume that the system treated in the preceding section experiences 
the periodically changing external generalized force 

Q* — Q 0 cos (at + a) (18.1) 

which we shall call the driving force for brevity’s sake. Lagrange’s 
equation (4.15) therefore becomes 

• • • 

pg -f- xq = —rq + Q o cos (at + a) 

Let us transform it to the form 


q + 2§g -f a\q = /„ cos (at + a) (18.2) 

where /„ = <? 0 /p; the other quantities are explained in the preceding 
sections. 

We have arrived at a linear non-homogeneous differential equation 
with constant coefficients. According to theorem (V.6), we can obtain 
its general solution by adding a particular solution of Eq. (18.2) 
to the general solution of the corresponding homogeneous equation, 
i.e. to the function (17.3). To find the particular solution, let us 
proceed in accordance with what is said at the end of Appendix V, 
namely, let us add the imaginary function i/ 0 sin (at -f a) to the 
right-hand side of (18.2) and seek the complex solution q of the 
equation obtained; after finding q, we shall take its real part, and 
the latter will be the solution of Eq. (18.2). Hence, we shall solve 
the equation 


* * * 

q + 2$q + cojfc = /„ [cos (at + a) + i sin (at -f- a)] 


Its right-hand side can be written as 


where 


/ o e i ((o(+a) = f oe Uit 

U = /o* ta 


(18.3) 


is the complex amplitude 1 of the driving force (more exactly, the 
force divided by p, but for brevity’s sake we shall call Q*l p simply 
a force). The differential equation written in the new notation will be 


q + 2Pg + a l a q = f 0 e iat (18.4) 

(we have omitted the cap over q to avoid complicated symbols), 
i 1 Compare with (16.10). 
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We shall seek the solution of Eq. (18.4) in the form 

q = oe iai (18.5) 

where a is the complex amplitude of the oscillation. Differentiation 
with respect to t yields 

q— iaae iu>t , q = (i©) 2 de iat — — co 2 ae iat (18.6) 

We see that in the complex representation of harmonically varying 
quantities time differentiation consists in multiplying the quantities 
by i co (in integration— in dividing them by i©). 

Introducing expressions (18.5) and (18.6) into Eq. (18.4) and 
cancelling the common factor e iai , we obtain the equation 

— © 2 a -f 2ipcoa + to 2 a = /„ 

from which we find 

[o 

(cog — o> 2 )4-2ipco 

We represent the complex number in the denominator as 

(©o — © 2 ) + 2ip© = pe i( P (18.7) 

where p is the modulus and cp is the argument of this number. There- 
fore, 

<18 - 8) 

It follows from (18.7) 1 that 


P = V K — ® 2 ) 2 + 4p 2 © 2 , tan <p = (18.9) 

Using the values of p and J 0 [see (18.3)1 in (18.8), we get the fol- 
lowing expression for the complex amplitude: 


a = 


/oj 

/ (Ci)2 — <A 2 ) 2 -f-4p 2 G) 2 


e I(a-ip) _ ae i(a-v) 


Finally, introducing the value of a into formula (18.5), we find the 
complex expression for q: 


q -- ae >(o>«+a-(p) 

1 Recall that a complex number can be depicted by a point P on a plane. 
The abscissa x of this point equals the real part of the number, and the ordinate 
y equals its imaginary part. Tne modulus of a complex number equals the modu- 
lus p of the position vector of the point P, while the argument 9 is the angle 
made by the position vector and the axis of abscissas. It thus follows that p = 
= y x* + y‘‘ and tan 9 = ylx. 
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Its real part coincides with the expression for steady-state forced 
oscillations known from the general course of physics: 


V (0)2— 0)2)2 + 4P 2 0) 2 


cos (cotf-f-a— -<p) 


(in textbooks of general physics, it is usually assumed that a = 0). 

We obtain the general solution of Eq. (18.2) by summation of the 
functions (17.3) and (18.10). We shall not stop off to analyse this 
solution and consider the phenomenon of resonance because this is 
done in sufficient detail in general courses of physics. 


19. Oscillations of a System with Many Degrees 
of Freedom 1 

Consider a conservative system with s degrees of freedom and having 
a position of stable equilibrium. In this position, the potential 
energy of the system U — U (q u q % , . . ., q s ) has a minimum. We 
shall measure the generalized coordinates q t from the equilibrium 
position. Bearing in mind that we shall limit ourselves to small- 
amplitude oscillations, let us expand the potential energy in pow- 
ers of q t , disregarding the terms of the higher orders of smallness: 

i x, h 


In the equilibrium position, all the generalized forces Q t = 
= — (dU/dqi ) o vanish. We also assume the energy U 0 to vanish. 
The expression for the potential energy can thus be written as 


where 


^ = r (19.1) 

i, k 

_ / d*U \ 
x ik x hl ( dqi d(Jh J 0 


are positive quantities (at a minimum, the second derivatives are 
positive). Since U is measured from its minimum value taken as 
zero, quadratic form (19.1) is positive definite. 

With stationary constraints, the kinetic energy is determined by 

a positive definite quadratic form of the variables q t [see (5.10)]: 


r = ( 19 - 2 ) 

i. k 

where 

Hi* = Tift (0) 

1 Before beginning to read this section, acquaint yourself with Appendices 
VII, VIII, and IX. 
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are the zero terms of the expansion of the coefficients y ih ( q ). By 
formula (5.8), y ih = y hi , therefore p, ift = p hi . 

Subtraction of expression (19.1) from (19.2) yields the Lagrangian: 

L = -j'2\i ih <lt<lk — -j'2Kik<liqh (19-3) 

t, ft l. ft 

To find the derivatives of L with respect to q * and q it we write the 
expression of the total differential of the function (19.3): 

dL = ~ y 2 i \i ik q i + d qi — d 9k — dq i 

i, ft i. k j. ft i. ft 

The subscripts i and k are dummy ones, therefore any letter may be 
used for either of them. Taking advantage of this, let us exchange the 
places of the subscripts i and k in the first and third sums: 

1 \ • • J • • 

dL = + dqi 

i, h i. h 

— = 2Pik?k dq i— 2**Wft dq ‘ 

i, ft i, ft j, ft i. ft 

(recall that and = y. ik ). The expression we have 

obtained can be written as 


dL= 2 d-Qi (2 fiift^ft) — 2 dq-i (2 *;ft?ft) (19.4) 

i h i k 


In an expression for the total differential of a function of several 
variables, the factor of a differential of a variable equals the partial 
derivative of the function with respect to this variable. It thus 
follows from (19.4) that 


dL 

— — = 2fhft?ft, 
ft 


dL 

dqi 


2 x i*?ft 

ft 


d dL 


Since the quantities are constants, the derivative — — = 


dqt 


— 2 Hence, Lagrange’s equations liave the form 

ft 


2^-ft?ft+ 2xift?ft-0 (i = 1, 2, . . . , s) (19.5) 


(compare with Eq. (16.4) for one variable). 

We have arrived at a system of linear homogeneous differential 
equations with constant coefficients. Let us attempt to seek the 
unknown functions q k ( t ) in the form (compare with (16.5)] 

q h = C h e 


(19.6) 
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where C h are complex constants that have to be determined. The 
functions (19.6) are complex, whereas the generalized coordinates 
are real. Consequently, upon completing our calculations, we shall 
have to take the real parts of functions (19.6) (see Appendix V). 
The introduction of expressions (19.6) into Eq. (19.5) yields 

2 H,ft ( - ® 2 ) C K e™ + 2 *tkC k e™ = 0 (i = 1 , 2, . . . , s) 

ft ft 

Cancelling e iat in all the equations, we obtain 

2(«/ ft -co> ifc )C fc = 0 (19.7) 

ft 


We have arrived at a system of s linear homogeneous algebraic 
«quations with the unknowns Ci, Co, . . ., C s . For this system to have 
a non-zero solution, it is necessary and sufficient that its determinant 
be zero: 


*u — ®¥ll *12 — ®Vl2 


* ls — 


*21 — co 2 ja 2 i >c 22 — co 2 p 22 ... X 2s — C0 2 fi 2s 


*,1— C0 2 p„ X si — 0) 2 p s2 ... X ss — CD 2 p. s3 


= 0 


(19.8) 


Isee the text following formula (VIII. 26) in Appendix VIII]. 

Equation (19.8) is known as a characteristic equation. It is an equa- 
tion of degree s relative to co 2 . In the general case, this equation 
has s different 1 real positive roots: to 2 , to 2 , . . ., g> 2 . The quantities 
— 1, 2, . . ., s) found in this way are called the natural frequen- 
cies of the system. 

Let us prove that the roots of Eq. (19.8) are real and positive. 
For this purpose, we multiply each of Eqs. (19.7) by C* (i.e. by a 
quantity that is the complex conjugate of the coefficient C,) and 
then summate all the equations. The result is 


2 (*ift — a2 ^ik) c*c k = o 

i , k 

•or 

2 *ifcCiC|, — w 2 2 M^fcC*c ft =o 

i. * i, » 

whence 

2 *ih^*Cft 

co 2 = 4i=r 

2 VihC*iC h 

i, k 


(19.9) 


The numerator and denominator of Eq. (19.9) contain quadratic 
forms like (IX. 21). It is shown in Appendix IX that such a form 


1 Multiple roots may be obtained in particular cases. 
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equals the sum of the quadratic forms, 2 ^ih a i a h + 2 

i,h i,k 

and correspondingly 2 l*ih a i a h + 2 H-ift&i&ft ( a i is the real part, 
i, ft i, ft 

and b t is the imaginary part of £,•). The latter forms, in turn, are, 
first, evidently real and, second, positive definite [see (19.1) and 
(19.2)]. We have thus proved that the numerator and denominator of 
Eq. (19.9) and, consequently, to 2 are real and positive. 

Thus, having solved the characteristic equation (19.8), we find s 
natural frequencies of the system: «!, co 2 , . . ., (o s . Introducing in 
turn the values co& into the system of equations (19.7) and solving 
the system, we find C k 's corresponding to different co a ’s. If the 
matrix of the system (19.7) has the rank s — 1 (which is usually the 
case), by (VIII. 28) the solutions of the system are 

t'ft — C a-Amk 

where c a is an arbitrary complex constant, and Ami is the signed 
minor of the element — coaM-mh i n the determinant of the system 
(m is chosen arbitrarily but with at least one Ami being non-zero). 
Since all the elements of this determinant are real, the quantities 
Ami are also real. 

Hence, for each generalized coordinate q k , we obtain s different 
solutions of the form 

<7ft = CvAXle"*** (a = 1. 2 ,...,*) (19.10) 

where Ami are real constants determined by the values of the coef- 
ficients x ih and p;fc, and also of the frequencies oj a . 

We obtain the general solution by summation of all expressions 
(19.10): 

? k = 2^SA‘ 

a 

Passing over to the real part of this expression, we obtain 
?ft = Re {2 c a A£le ia « t } = 2 -^mftRe {cj^} 

a a 

Finally, representing c a as a a e i6a (here a a is the modulus of c a , 
i.e. a real positive quantity), we arrive at the expression 

?h= 2 A ( mla a cos (w a t + 6 a ) (19.11) 

a=i 

Consequently, the change in each generalized coordinate q h with 
time is the superposition of s harmonic oscillations whose frequencies 
equal the natural frequencies of the system. The quantities a a and 
8 rt are determined from the initial conditions. 
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Expressions (19.11) can be greatly simplified with a special selec- 
tion of the generalized coordinates. It is shown in Appendix IX that 

when we have two quadratic forms— one from the variables q k and 
the other from the variables q h , the first of them being positive 
definite, a linear transformation of the variables q h exists such that 
reduces both forms to a diagonal type [see diagram (IX. 37)]. Let us 
pass over with the aid of such a transformation from the variables q h 
to the variables The quadratic forms (19.1) and (19.2) will now 
become diagonal: 

(i9.i2) 

a ft 

The Lagrangian will be as follows in the new variables: 

ft & 

and Lagrange’s equations will be 

ift + hh = 0 (k = 1, 2 s) (19.13) 

The equations of motion in the coordinates thus split up into a 
independent equations each of which is identical to Eq. (16.4). We 
must note that owing to the positive definiteness of the quadratic 
form for the potential energy U, all the coefficients are positive. 
They can therefore be written as 

*ft = ©ft 

where co* are real quantities. 

Let us write the solutions of Eqs. (19.13): 

1 ft = % cos (co h t + 6 ft ) (k = 1, 2, . . ., s) (19.14) 

[see formula (16.7)]. 

We have found that the generalized coordinates perform a simple 
harmonic oscillation independently of one another, and each with 
its own frequency co ft . The generalized coordinates determined in 
this way are called normal (or principal), and the simple harmonic 
oscillations they perform— normal oscillations of the system. 

We must note that the normal coordinates l h are related to the 
arbitrary generalized coordinates q h by means of linear transforma- 
tions, i.e. transformations of the form 

(19-15) 

X 

Hence, can be obtained as a linear combination of the coordin- 
ates q j. 
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20. Coupled Pendulums 

Consider the small-amplitude oscillations of a system consisting 
of two identical simple pendulums connected by a weightless spring 
{Fig. 20.1). Assutno that the pendulums can oscillate only in the 
plane of the drawing so that the sys- 
tem has two degrees of freedom. Wo 
choose q>! and cp 2 — the angles of deflec- 
tion of the pendulums from the 
vertical direction— as the generalized 
coordinates. The length of each pen- 
dulum is l and its mass is m. The 
ends of the spring are fastened on the 
pendulum rods at a distance b from 
the point of suspension. The spring 
is chosen so that when <p x = cp 2 , its 
tension is zero. 

Let us write an expression for the 
potential energy U of the system, 
assuming that U is zero in the equi- 
librium position: 

U — mgl (1 — cos cpj) + mgl (1 — cos <p 2 ) + y k ( b sin cp 2 — b sin cpj) 2 

For small-amplitude oscillations, we can assume that sin tp — cp 
and cos cp = ]/ 1 — sin 2 cp = ’|/ r l — cp 2 = 1 — ^ cp 2 . The expression 
for U thus becomes 

U = y mgltf + y mgh p 2 + y ki>2 (fa — <Pi) 2 

= y [(mgl + kb 2 ) <p 2 — fc6 2 (p,(p 2 — kb 2 cp,^ + ( mgl + kb 2 ) <p 2 ] (20.1) 

The kinetic energy in the same approximation is 

T = Y [ml 2 ^\ + ml 2 y\] (20.2) 

A comparison of expressions (20.1) and (20.2) with expressions 
(19.1) and (19.2) yields the following values for the coefficients 
jc i(l and 

*u = *22 = mgl -f- kb 2 , x 12 = x 21 = — kb 2 (20.3) 

Pn 1 P 22 = ml 2 , (,i 12 = jx 21 = 0 

The introduction of these values of the coefficients into Eq. (19.5) 
leads to the differential equations 

ml 2 cpi -f ( mgl -f- kb 2 ) qq — && 2 <p 2 = 0 

m/ 2 <p 2 — kb 2 ^ i + ( mgl + kb 2 ) q> 2 = 0 


} 



(20.4) 
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We shall seek the solutions of these equations in the form 

9 ! = Cie iat and cp 2 = C 2 e iat (20.5) 

Let us substitute these expressions into Eq. (20.4). After cancelling 
e la>t and combining similar terms, wo get a system of equations for 
determining the constants C x and C 2 : 

(mgl + kb 2 -— ml 2 ® 2 ) C x — kb 2 C 2 = 0 1 

— kb 2 C x -\-(mgl-\-kb 2 — ml 2 ® 2 ) C« — 0 J 

For this system to have a non-zero solution, its determinant must 
equal zero: 

(mgl + kb 2 - ml 2 ® 2 ) ( - kb 2 ) __ 

(-kb 2 ) (mgl + kb 2 -ml 2 ® 2 ) 

i.e. the following condition must be satisfied: 

(mgl + kb 2 — ml 2 ® 2 ) 2 — (—kb 2 ) 2 — 0 

The latter equation can be written as follows after simple transfor- 
mations: 

We have arrived at a quadratic equation relative to ©*. The roots 
of this equation are 

< = T and “T + 2 ^"F 

Consequently, the natural frequencies of the system will be 

co, = )/f and ©, = ]/-f + 2 (20.7) 

Let us introduce the square of the first natural frequency, i.e. 
©* into Eq. (20.6) instead of to 2 . After simplification, system (20.6) 
becomes 

kb 2 C 1 - kb 2 C 2 = 0 
-kb 2 C 1 + kb 2 C 2 = 0 

'The solutions of this system are obvious: 

C x = C t = c x = fte"! (20.8) 

where c x is an arbitrary complex constant, a x is its modulus, and 
is its argument. 

The introduction of (20.8) into (20:5) yields complex solutions of 
differential equations (20.4) corresponding to the frequency ©,: 

<p<“ == dci®** =•. a 1 e i(<0,t+e,) 

jpU) — Cl giui! = ,a,e i(<Di( tA>) 




SMALL-AMPLITUDE OSCILLATIONS 


79 - 


Taking the real part of the functions we have found, we obtain 

{p' 1 ’ = a x cos ((o x t -f 6^, cpj° = a r cos -j- 6 2 ) (20.9) 

Now let us introduce the square of the second natural frequency,, 
i.e. io 2 , into Eq. (20.0) instead ol or. The result is 

-kb i C 1 — kb*C 2 = 0 

— kb 2 C 1 - kb 2 C 2 = 0 

The system is satisfied by the values 

C i == — C 2 = = C 2 === d 2 6^* 

The functions (20.5) will accordingly be 


q><« = c 2 e ia ^ = a 2 e i (“* t+6 »> 

<p^ 2> = — c 2 e i0) 2 J = — a 2 e { (“it+*») 

and their real parts will be 

<p' 2> — a z cos (co 2 t -j- 8 2 ) 1 

<pl 2> = — a 2 cos (co 2 i + 6 2 ) J 


( 20 . 10 ) 


We obtain a general solution of system (20.4) by the summation of 
solutions (20.9) and (20.10). Consequently, 


<Pi = ( Pi 1> + <Pi 2> = 0.1 cos (co,i + 8 t ) + a 2 cos (co 2 t + 6 2 ) | 2Q ^ 

<p 2 = q)"’ + <?T = a i cos (“i* + Sj) — a 2 cos (co 2 i + 6 2 ) j 

Let us go over from the generalized coordinates tp x and cp 2 to the 
new variables | x and £ 2 , which we shall determine as follows: 



(<Pi + <P 2 ) and | 2 == T (cpi — fpz) 


With a view to (20.11), we obtain 

^ = a ‘Cos(o)i*+8i) j (20 . 12 > 

£ 2 = a 2 cos (co 2 f + 6 2 ) J 

The variables and l 2 are thus normal coordinates of the system of 
coupled pendulums being considered. The generalized coordinates 
tp! and <p 2 are expressed in terms of and £ 2 with the aid of the 
linear equations 

<Pi = h + U and cp 2 = Ei — £2 ’ (20.13) 

Assume that only the first normal oscillation is performed in the 
system. This signifies that \ 2 == 0. Inspection of (20.13) shows that 
in this case 

Ti = <Pj — ?i = a \ cos (©!* + 8j) 
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i.e. both pendulums oscillate like a single whole with the frequency 
<!>!, being at each instant deflected to the same side through the 
same angle (Fig. 20.2a). The spring is not deformed so that each 
pendulum- oscillates as if the other one were absent («i = Y sH)- 

Now assume that only the second 
normal oscillation is being per- 
formed in the system. A glance at 
(20.13) shows that in this case 

<Pl = — <f>2 = l 2 = fl 2 COS (0) 2 * + 6 2 ) 

At each instant, the pendulums are 
deflected through an identical angle, 
but in opposite directions (Fig. 
20.26). 

The connection between the pen- 
dulums can be characterized with 
the aid of the spring constant k. Let us call the latter the coupling 
coefficient. Consider the case of a weak connection, i.e. a small k. 
If (k/m) <C (g/l), the difference between the natural frequencies will 
be much smaller than the frequencies themselves: 

co 2 — ©! <C (Ox (20.14) 



(0) (6) 

Fig. 20.2. 


Let us deflect the first pendulum through the angle cp 10 = a, keeping 
the second one at its zero position. Now let the system oscillate. 
The initial conditions in this case will obviously be 

• • 

9io = «, <P 20 = 0, ((px) 0 = 0, (<p 2 ) 0 = 0 


We shall find the values of the constants a 1 , a 2 , 8 lt and 6 2 . For 
this purpose, we assume that t = 0 in (20.11). The result is 


9io = cl — a 1 cos 61 -j- a 2 cos 6 2 
920 = 0 = a t cos 81 — a 2 cos 8 2 


(20.15) 


Now let us differentiate expressions (20.11) with respect to time 
and assume that t — 0 in the formulas obtained. This leads to the 
expressions 

(q>i)o = 0— — ajcoj sin 8 t — a 2 o) 2 sin 8 2 
( 92)0 = 0= — a 1 co 1 sin6 t -l-a 2 a) 2 sin8 2 

Solving Eqs. (20.15) and (20.16) simultaneously, we find that 
a 1 = a 2 = -y and 6, — S 2 == 0 




(20.16) 
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Hence, in the case being considered, the oscillations have the 
form 

a / . . co, — co, . co„4-co, . 

cpj — — (cos co,i -f- cos cooi) — a cos — — 1 1 ■ cos — ^ — 1 t 

a , . .. .co, — co, . . co, -4- co, . 

cp 2 — ~y (cos co t t — cos co 2 f) = a sin — ^ — * t ■ sin — ^ — 1 t 

With a weak coupling, we have (oi 2 — oij) <C (o> 2 + o>i) [see 
(20.14)]. We may thus consider that each of the pendulums performs 
harmonic oscillation at the frequency (oo 2 + u>,)/2 as (>>! with a 
slowly varying amplitude. Hence, the motion of each of the pendulums 
has the nature of beats. The amplitudes change with a phase shift of 
it/ 2. When the amplitude of one of the pendulums reaches its maxi- 
mum value, the amplitude of the second one vanishes, and vice versa. 
In the process of oscillations, energy is pumped, as it were, from one 
pendulum to the other. 

When only one normal oscillation or $ 2 is produced, no transition 
of energy from one pendulum to the other occurs. 




Chapter V 


MECHANICS OF A RIGID 
BODY 


21. Kinematics of a Rigid Body 


By dividing a continuous rigid body into elementary volumes of 
mass p dv (here p is the density of the body), it can be represented 
as a system of particles with rigid constraints. 

A rigid body is known to have six degrees of freedom— three trans- 
lational and three rotational ones. To describe the motion of a rigid 
body, let us choose the inertial reference frame K (with the axes 
X u X 2 , X 3 ) which we shall consider to be stationary. We shall 
rigidly connect to the body another frame K' (with the axes x u x 2 , x 3 ) 
and place its origin at the point A of the body. It is convenient to 
take the three coordinates of the origin of the frame K' (the position 
vector R a corresponds to them) and the three angles characterizing 
the orientation of the axes x t , x 2 , x 3 relative to the axes Xj, X 2 , X 3 
as the generalized coordinates determining the position of the body. 
These axes make nine angles with one another, but only three of 
them are independent; the other six can be expressed through the 
values of the first three 1 . It is customary practice to use the Euler 
angles <p, •&, ij; (see Sec. 22) as the three angles determining the 
mutual orientation of the axes of the frames K and K' . 

Any elementary displacement of a rigid body can be represented 
as the sum of a translational displacement when all its points are 
displaced over the same distance dR A and rotation through the angle 
d® about an axis passing through the point A. 

Since the velocities v of the points of the body in the frame K' 
are zero, formula (15.3) for the velocity of a point whose position 
in the frame K' is determined by the position vector r (A ) 2 acquires 

1 There are six relations between the cosines of these angles [see formu- 
la (VI. 39)]: 

^ i = (fi k=i, 2, 3; i ^ ft) 

m 


[aj h = cos ( xi , Xfc)]. 

2 Let us agree on notation. In this chapter, we shall use indices of two kinds: 
(1) without parentheses, and (2) in parentheses. Indices without parentheses 
will indicate a particle or point which the given quantity relates to. For in- 
stance, m a is the mass of a particle whose number is a, r„ is the position vector 
of the same particle, and R A is the position vector of the point A . 

Indices in parentheses will indicate the point from which a position vector 
emerges, or the point relative to which a moment or angular momentum is cal- 
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the form 

V = V A + [©(A), r (A) ] (21.1) 

where V A is the translational velocity of the body (the velocity of 
the point A observed in the frame K), and © (A) = d<\>ldt is the 
angular velocity of rotation of the body about an axis passing through 
the point A. The first term in this formula is the same for all the 
points of a body, the second is a position function. 

If we had placed the origin of the frame K' at another point of 
the body, say at a point B, formula (21.1) would be as follows: 

v = V B + I©(B)» r (B )l (21.2) 

where Yb is the velocity of the point B observed in the frame K, 
and ©(B) is the angular velocity of rotation of the body about an 
axis passing through the point B. 

The position of an arbitrary point of the body in the frame K 
is determined by the same position vector in both cases: 

R — R a + r CA ) = Rb 4* T(b) 

It thus follows that the position vector r (B ) can be represented as 

r(B) = a + r (A ) (21.3) 

where a = R A — Rb is a position vector from the point B to the 
point A, i.e. a quantity not depending on which point of the body 
we write formula (21.3) for. 

Using the value given by (21.3) in formula (21.2), we obtain 

V = V B + [©( B )i al + [©(B), r (A )] (21.4) 

The first two terms on the right-hand side of (21.4) are identical 
for all the points of the body, while the third term is a position 
function. 

Formulas (21.1) and (21.4) determine the same quantity — the 
velocity of the point of the body being considered in the frame K. 

culated, etc. Depending on the circumstances, we shall use these indices either 
as subscripts or superscripts on the relevant symbol. For example, r< A ) or 4 A * 
will stand for a position vector emerging from the point A\ M( A ) or M (A \ for 
the angular momentum relative to the point A . The absence of an index in 
parentheses at the symbol r or M will signify that the relevant quantity is 
taken relative to the centre of mass C of the body. Hence, we shall designate a 
position vector emerging from the point C either by the symbol r«p or simply 
by r, the angular momentum relative to the point C, by the symbols M(c) and 
M, and so on. 

We shall use the symbol R only in one quite definite case— to denote a posi- 
tion vector emerging from the origin of the stationary reference frame K (with 
the axes X, F, Z). Therefore, there is no need to use an index in parentheses in 
the given case, and we shall not write it. 

We shall use lower case letters (r, x, y, z, etc.) to denote position vectors 
drawn from the origin of the reference frame K' rigidly connected to a body, 
the coordinates in the frame K' , etc. 


6 * 
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Consequently, at any r (A) , the right-hand sides of these formulas 
must coincide. This is possible provided that 

V A - V B + [® (B „ a] (21.5) 

t®(A )7 r (A) ] = [e)(B)i r {A) ] (21.6) 

[the identity sign stresses that equality must hold at any values of 
r<A)]- 

A glance at identity (21.6) shows that 

©(A) = ©(B) 

i.e. that the angular velocity of rotation about any axis is the same, 
and we can speak simply of the angular velocity (o of the body regard- 
less of our choice of the reference frame K' . The translational veloc- 
ity, as can be seen from relation (21.5), does not have an absolute 
nature, however; it depends on the position of the origin of the 
frame K' (that is, V A ^ V B ). 

Suppressing the superfluous subscript on o>, let us write relation 
(21.5) as follows: 

V B = V A — [oa] (21.7) 

Two cases are possible: (1) the vectors V A and o> are mutually per- 
pendicular, and (2) the vectors V A and w make an angle differing 
from nl 2. It is easy to see that in the first case the vectors V A and 
[oa] are coplanar. Consequently, the vectors V B and V A are also 
coplanar. Hence, the vector V B , like the vector V A , will be per- 
pendicular to the vector o>. This allows us to make the following 
conclusion: if the vectors V A and w are mutually perpendicular with 
our choice of the origin of the frame K' , these vectors will also be 
mutually perpendicular at any other choice of the origin of the frame 
K' (with any other choice of the point A). < 

Let us now turn to formula (21.1) and write it as 

V = V A + l©, r (A) ] (21.8) 

This formula shows that when the vectors V A and © are mutually 
perpendicular (which, if this occurs, is observed with any choice of 
the point A), the vectors V and V A will be coplanar, and the veloci- 
ties V of all the points of a body are in planes perpendicular to the 
vector (i). By varying our choice of the point A, we can find a position 
of it for which 

V A = V — [©, r (A) ] (21.9) 

vanishes 1 (here the point A may be outside the body). As a result, 


1 Both terms on the right-hand side of (21.9) are position functions of points 
of the body (V is the velocity of a point of the body in the frame K, and 
r( A > is the position vector of this point in the frame K'). The difference of these 
terms for all the points of the body is the same and equals V A . 
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the motion of a solid will be represented as only rotation about an 
axis called the instantaneous axis of rotation of the body [see (21.8)]. 

When the vectors V A and o) are not perpendicular to each other, 
we can choose the point A so that these vectors will be collinear. 
Consequently, the motion of the body at each instant is the super- 
position of two motions: rotation about an axis at the angular veloc- 
ity a) and translational motion at the velocity V A along the same 
axis. We shall not stop to prove this statement. 

We must note that the formulas of the dynamics of a rigid body 
become especially simple if we make the origin of the frame K' 
coincide with the centre of mass C of a body. In the following, we 
shall usually proceed in exactly this way. Formula (21.8) will 
therefore acquire the form 

V = V c + [wr] (21.10) 


22. The Euler Angles 

The Euler angles are determined as follows. Assume that the axes 
of the frame K' associated with a body first coincided with the axes 
of the frame K. Next the body turned, as a result of which the 



(a) (b) (c) 

Fig. 22.1. 

orientation of the axes of K' in space changed. Any such rotation 
can be performed with the aid of the three rotations shown in 
Fig. 22.1. 

1. Rotation about the Z-axis through the angle <p (Fig. 22.1a). 
The direction n followed by the x-axis is called the nodal line. 

2. Rotation about the nodal line through the angle {1 (Fig. 22.16). 

3. Rotation about the z-axis through the angle ip (Fig. 22.1c). 
The direction of each of these rotations is related to the direction 

of the axis about which it occurs by the right-hand screw rule. 

Examination of Fig. 22.2 shows that the nodal line is the line of 
intersection of the coordinate planes XY and xy. The angle cp is formed 
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by the X-axis and the nodal line, the angle ip by the nodal line and 
the z-axis, and, finally, the angle $ is the angle between the axes Z 

and z. The angles ft and cp are 
the polar coordinates of the point 
of intersection of the z-axis and 
a sphere of unit radius. This 
point is known as the apex. 

For the set of angles cp, ft, and 
ip determining each real rotation 
to be unique, it is assumed that 
the angles <p and op can have 
values from zero to 2 jt, while the 
values of the angle ft are limited 
to the interval from zero to n. 
If the angle ft were also allowed 
to have values from 0 to 2n, the 
rotation depicted in Fig. 22.3, 
for instance, could be charac- 
terized either by the set of an- 
gles cp = ji/ 2, ft = it/2, ip = 0 (the 
upper sequence of rotations; the 
axes X, Y, Z are not shown in the figure, their orientation 
coincides with the initial orientation of the axes x, y, z) or by the 
set <p=3ji/2, ft = 3 ji/ 2, ip = n (the lower sequence of rotationsl. 



Fig. 22.2. 



Fig. 22.3. 

Assume that the Z-axis is directed vertically and the frame K' 
is rigidly associated with a top (gyroscope), the z-axis coinciding 
with the top’s axis of proper rotation. It is now a simple matter to 
see that a change in the angle ip corresponds to rotation of the top 
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itself, a change in the angle cp corresponds to rotation of the vertical 
plane containing the z-axis, i.e. to precession of the top, and, finally, 
a change in the angle '0 to motion of the top’s axis called nutation 1 . 
Accordingly, the angle q> is called the precession angle, the angle 
ft— the nutation angle, and the angle t(i— the angle of proper rotation 
(or the angle of pure rotation) 2 . 

The rate of the change in the angle cp can be characterized by 
the angular velocity vector to,, directed along the Z-axis (see 

Fig. 22.2); the magnitude of this vector is cp. Let us resolve the vector 
©cp into two components, one of which is directed along the z-axis 

(its magnitude is cp cos ft), and the second is perpendicular to the 

z-axis, i.e. is in the plane xy (its magnitude is cp sin ft). The second 
component is obviously perpendicular to the nodal line n and, 
consequently, makes the angles n/2 — of and if with the axes x and 
y, respectively. We can conclude from the above that the projections 
of the vector ©^ onto the axes of the frame K' are 


• • 

= cpsin ft cos (it/2 — t|>) = cp sin ft sin \]i 

(<%) 2 = <psin ft cos tjt 
(®q,) 3 = cp cos ft 


( 22 . 1 ) 


The rate of the change in the angle ft is characterized by the 

• 

vector ©o directed along the nodal line: its magnitude is ft. The 
nodal line is perpendicular to the z-axis, and makes the angles i|j 
and ip -f n/2 with the axes x and y, respectively. Consequently, the 
projections of the vector ©$ onto the axes of the frame K' are 


(©#)!= ft cos if 

• • 

(©0)2 — ft cos (ty + n/2) — — ft sin ip 

(©a) 3 = 0 


( 22 . 2 ) 


Finally, the rate of the change in the angle ip is characterized 

by the vector ©^, directed along the z-axis (its magnitude is ip). The 
projections of this vector onto the axes of the frame K' are 

(©’t)i = 0 , (©t|>);j — 0 , (©1(1)3 — ip ( 22 . 3 ) 


1 General courses of physics usually deal only with regular precession char- 
acterized by the angle between the top’s axis and a vertical line remaining un- 
changed. Actually, as a rule, the top’s axis oscillates in the plane Zz about a 
certain middle position. This oscillation is what we call nutation. 

2 The letter ip is sometimes used to denote the angle of precession, and the 
letter q>— the angle of proper rotation. 
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The vector of the angular velocity to at which the body rotates 
relative to the frame K (the body is always stationary relative to the 
frame K') can be represented as the sum of the angular velocities of 
each of the three rotations corresponding to the changes in the Euler 
angles: 

to = + t»o + 

Therefore, the following values are obtained for the projections of 
the angular velocity to onto the axes of the frame K' with account 
taken of formulas (22.1), (22.2), and (22.3): 


• • 

o)! = <p sin ft sin ij> -f ft cos <p 
• • 

co 2 = <p sin ft cos \|3 — ft sin 

• m 

a) 3 = cp cos ft + Ijj 

We shall need these formulas in the following. 


(22.4) 


23. The Inertia Tensor 1 

Assume that we are observing the motion of a solid in the stationary 
reference frame K (whose axes will be designated Xj. X 2 . X 3 , or 
X, Y, Z). In accordance with what was stated in Sec. 21, let us 
connect the reference frame K' having the axes x 1: x 2 , x 3 (or x, y, z) 
to the body rigidly. We divide the body mentally into particles of 
mass m a z . According to formula (21.10), the velocity of the a-th 
particle will be written as follows: 

V a = V c + (to, rj (23.1) 

Let us calculate the kinetic energy of the body. It is 
T = l2^ = 42^{Vc + [o», rjp 

a a 

= y 2 m « V C + 2 m <* V C r al +42 l®> r al 2 

a a a 

In the first term on the right-hand side, we can put the factor V£ 
outside the sum sign. This term therefore becomes y mVc, whore 

m = 2 m a is the mass of the body. In the second term, we shall 
perform a cyclic transposition of the factors (see (VI. 3)], after which 
we shall put the constant factor outside the sum sign. The result 
is the expression [Vc, to] 2 m a r a — IV C , to] mr c , where r c is the 

1 Before beginning to study this section, acquaint yourself with Ap- 
pendix X. 

2 The subscript a indicates a particle’s number. We use the Latin subscripts 
i, A;, /, ... to number coordinate axes, vector components, etc. 
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position vector of the centre of mass C. If, as we have agreed upon, 
we place the origin of the frame K' at the point C , the second term 
vanishes. 

Hence, the kinetic energy of a rigid body breaks up into two terms. 
The first of them, 

^trans = "2" (23.2)t 

is the kinetic energy of translational motion. The second term 

^rot = 4-2 m « [«, r a ]2 (23.3} 

a 

is the kinetic energy of rotation. 

We must stress the fact that both these energies are absolutely 
independent— one depends only on V c , and the other only on ( 0 . 
Since the origin of the frame K' coincides with the point C , the term 
containing both V c and o> vanishes. 

Let us transform expression (23.3). First, we shall replace the 
square of the vector product in accordance with (VI. 6): 

r rot = 42 m '«{w 2 4— (cor a ) 2 } (23.4). 

a 

We shall now write this expression using the projections of the- 
vectors o> and r a onto the axes of the frame K' . The projections of 
r a onto the axes of the frame K' equal simply the coordinates of the 
particle x la , x 2a , x 3a . Let oi!, co 2 , <o 3 stand for the projections of the 
vector <0 onto the axes of the frame K' . Expression (23.4) therefore 
acquires the following form in the components of the vectors: 

= [f ) ( IX“)~( ( IX X ft<x)] 

ail i h 

= [( IX ) ( IX“) - 2X 04 ***®**®] 

a j I i, h 

(we remind our reader that a dummy index can be designated by any 
letter). 

In the expression we have obtained, the quantities co; and co ft 
do not depend on the subscript a and they could be put outside the 
sign of the sum over a. This is prevented by the circumstance, how- 
ever, that the first term of the expression includes the sum of the 
squares oii and the second term, the sum of the products 01,(0^. This 
obstacle can be eliminated by replacing the sum of the quantities 
cof with the expression J, C0i(0 ft 6 ift which is obviously equivalent 

i t h 
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to S ©j. The formula for the rotational energy therefore becomes 

i 

^rot = ’j2 ,71 a [( 2 (0 i £0 * 6 «*) ( 2 Z ’“) ~ ^iViAa] (23.5) 

a i, k l i, k 

(we must note that 2 x ?a is simply a scalar 1 depending on the sub- 

L 

script a; each of the addends (OjCOfcdjk is multiplied by this scalar). 

In expression (23.5), summation is first performed over the sub- 
scripts i and k, and then over the subscript a. Let us change the 
sequence of summation so that summation over the subscripts i 
and k will be performed the last, i.e. rewrite (23.5) as follows: 

^rot = ~2 2 2^*® ( 2^ a ) x la x ha J 

i, h a l 

If we introduce the symbol 2 

Ilh= 2 m ® [ 6 <ft ( 2 Z '«) — (23.6) 

a l 

the expression for the kinetic energy of rotation can be written as 

^'rot == ‘2'2 Iih a l®h (23.8) 

i. h 

The quantity determined by formula (23.6) is a number (but not 
an invariant!) depending on the subscripts i and k. There are al- 
together nine such numbers. It can be seen that the set of quantities 
Ith forms a second -rank tensor 3 . Indeed, the product of the scalar 


* 2 x la ~ r a~ lnv> 

l 

2 When calculating the quantities J; h for a continuous body, p dV must be 
taken instead of m a and summation replaced by integration. Hence 

/«= j[«lk ( 2*? ) — *i*k] P dV (23.7) 

I 

2 This conclusion can also be arrived at by the following reasoning. Let us 
write expression (23.8) in the form 

7Vot = y2 w * 2 / '* fi) * 
i h 

This expression can be invariant only if the quantities are the i-th com- 

h 

ponents of a vector [see Appendix VI, the text following formula (VI. 28)]. The 
latter, in turn, is possible only when the quantities J jft are the components of a 
tensor [see Appendix X, the text between formulas (X.22) and (X.23)]. 
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2 xfa and the unit tensor 6 ift is a tensor (see formulas (X.17) and 

i 

(X.19)], and the products Xi a x ha are products of the components 
of the vector r a , i.e. also a tensor [see (X. 16)]. Finally, the difference 
of the relevant components of two tensors also gives the components 
of a tensor [see (X . 18)]. 

Hence, quantities (23. G) are the components of a tensor. The latter 
is called an inertia tensor. Quantities (23.6) do not change when 
the subscripts i and k are transposed. Consequently, the inertia 
tensor is symmetrical (I ih = I hi ). 

Let us write the components of the inertia tensor using the con- 
ventional notation of Cartesian coordinates: 

2 m (y z -f- z 2 ) — 2 mxy — 2 mxz 

(Ith)= —2 ™>yx 2 rn(x 2 + z z ) —2 myz (23.9) 
_ — 2 mzx — 2 mz v 2 m ( xZ + y 2 )_ 

(to avoid making the formulas more complicated, we have suppressed 
the subscript a on m and on the coordinates x, y , z; all the sums are 
taken over this subscript). 

The diagonal components of the tensor are known as axial mo- 
ments of inertia. They coincide with the moments of inertia of 
a body relative to the corresponding coordinate axes known from 
the general course of physics. The non-diagonal components are 
called centrifugal moments of inertia. 

The geometric shape of a symmetric tensor is an ellipsoid. In the 
case being considered, it is an ellipsoid of inertia. The directions 
in a body coinciding with the semiaxes of an ellipsoid of inertia 
are called its principal axes of inertia. They intersect at the centre 
of mass of the body. If we direct the axes of the frame K' (i.e. the 
axes x, y, z; we remind our reader that these axes are rigidly con- 
nected to the body) along the principal axes of inertia of a body, the 
inertia tensor will be reduced to a diagonal form 

th 0 0\ 

0 / 2 0 (23.10) 

\0 0 ij 

Tbc values of /,, / 2 , / 3 of tbo diagonal compononts of a tensor 
(in the case when itj has been reduced to a diagonal form) are 
called the) principal moments of inertia of a body (they could be 
designated by the symbols I x , I y , /.). 

If the principal axes of inertia have been chosen as the axes x, y, z 
associated with a body, expression (23.8) for the kinetic energy of 
rotation is simplified as follows: 

T’rot = Y ^ lC °i 


(23.11) 
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or 

T’rot — lytoy + ^z M f) (23.12) 

(do not forget that ©j, © 2 , © 3 are the projections onto the axes 
x, y, z of the vector © — the angular velocity of rotation of the body 
observed in the frame with the axes X , Y, Z). 

When the vector © coincides with one of the principal axes of 
inertia along which we direct, say, the z-axis, the expression for the 
energy becomes even more simple: 

y,0t=|v (23.13) 

An expression similar to (23.13) is also obtained when a body 
rotates about an axis fixed relative to the body and passing through 
its centre of mass 1 . By directing, say, the z-axis along this axis, we 
find that ©* = = 0, and co z = ©. Consequently, of the nine 

addends of formula (23.8), only the one in which i = k = z will be 
non-zero, so that 

rrot = y/„ffl2 = Y/*z«B* (23.14) 

where I zz in the general case is not one of the principal moments of 
inertia. 

We must note that, for example, for a body such as a homogeneous 
sphere, the ellipsoid of inertia degenerates into a sphere. Therefore, 
the principal axes of inertia are not fixed relative to the body. This 
signifies that any three mutually perpendicular axes passing through 
the centre of the sphere can be taken as the principal axes. In this 
case, all the principal moments of inertia are the same: / 2 = I 2 = 
= I 3 — /, and the tensor of inertia can be written as 

(In) = I (6i») (23-15) 

where (6 ift ) is a unit tensor [see (X.17)], and / is a scalar. 

Everything that we have said about a sphere also holds for a 
homogeneous cube. Indeed, relation (23.15) evidently holds for it. 
Consequently, the ellipsoid of inertia for a cube degenerates into 
a s'phere. For this reason, any axis (and not only an axis of symmetry) 
passing through the centre of a cube may be considered as a principal 
axis of inertia. This is why a cube, in addition to a sphere and other 
bodies for which I 1 = I 2 = / 3 , is called a spherical top 2 . 


1 If this axis does not coincide with any of the principal axes of inertia, it 
must be retained in place with the aid of bearings. 

2 For a spherical top, the energy is always expressed by formula (23.13), 
where by 1 2 we must understand the scalar factor / in expression (23.15) for 
the inertia tensor. 
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A body for which two principal moments of inertia are equal (for 
instance, I x = I 2 =/= I 3 ) is called a symmetrical top. Finally, a body 
for which all three principal moments of inertia are diSerent is called 
an asymmetrical top. 

Up to now, in considering the inertia tensor, we assumed that the 
origin of the frame A" associated with a body is at the centre of 
mass C of the body. The inertia 
tensor can also be determined by 
formula (23.6), however, with re- 
spect to the frame K { A > associated 
with a body and having its origin 
at an arbitrary point A. The tensor 
components in this case will be 
T ( a) 

1 i ft 

= 2 {Sift 1 3 (4« , )*I 

a l 

(23.16) 

The values of I are related by 
simple expressions to the values 
I ik of the tensor components deter- 
mined with respect to the frame K[ C ) 
with its origin at the point C and 
with axes parallel to those of the 
frame K { A > (Fig. 23.1). To find these relations, let us use the symbol a 
to designate the position vector of the point A in the frame K{ C y 
Hence, for any point of the body, r( A) = r — a and, consequently, 

4 A) = x t — a t (23.17) 

where a t is the i-th coordinate of the point A in the frame K { C) . 
Let us introduce the values (23.17) into formula (23.16): 

4ft’ = S m « {Sift (3 (*ia — a^j-ixta — a^ix^- a h )} 

a l 

= 21 {8 lh (2 xfa)—x ia x ha } +>]m a (2 a?) — a ; a A } 

a l a i 

— 2 ™ a 8 lh 2 2ar; a a ; + 2 m a x ia a h + 2 ^a^ftafl/ 

ala a 

The first of the five sums on the right-hand side is I ik . In the second 
sum, none of the quantities in the braces has the subscript a. In 
addition, 2 the square of the vector a, i.e. a 2 . Therefore, the 
second sum can be written as m(a 2 S iA — a 4 <z h ). The third sum can be 

written as 2 %&ih a i 2 m a x ia- But 2 m a x ia — Tnxic = 0, so that 
l a a 

the third sum vanishes. Similarly, factoring out the multiplier in 
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the fourth and fifth sums that does not depend on a and taking into 
account that x t c = Xhc = 0, we find that both these sums also vanish. 
We thus arrive at the relation 

4* ) = IiH + m(a^ lk - a t a h ) (23.18) 

which is the tensor form of writing the parallel axis (Steiner) theorem. 
To verify this, let us find the component /< A > = I<£). According to 

(23.18) 

4 A) = I Z z + — a D = Izz + m(a| + a*) = I zz -f mal 

(23.19) 

where a x is the distance between the axes z and z< A >. 

We calculate the kinetic energy of a body rotating with the angular 
velocity w about a stationary (in the frame K) axis fixed relative 
to the body and not passing through its centre of mass. If the centre 
of mass is at a distance 'a L from the axis of rotation, its velocity is 
F c = o>aj_. Consequently, 

^trans ~ y 

where m is the mass of the body. We find the rotational energy by 
formula (23.14), directing the z-axis parallel to the stationary axis of 
rotation. Summating both energies, we obtain 

T = y m(o 2 a± + y I zz ^ = y (ma\ + I zz ) = i 4 A) (23.20) 
[see formula (23.19)1. 

Consequently, the formula for the kinetic energy of a rotating 
body considered in the general course of physics is true not always, 
but only in two cases: (1) when the body rotates about one of its 
principal axes of inertia [see formula (23.13)1, and (2) when the'body 
rotates about a stationary axis fixed in it [see formulas (23.14) and 

(23.20)1. 

In concluding, we shall find the form of the inertia tensor when 
only one of the coordinate axes, say the z-axis, coincides with one 
of the principal axes of inertia of a body. The transition from a frame 
all of whose axes coincide with the principal ones to the frame we 
are interested in is achieved by rotation about the z-axis through the 
angle cp. It is easy to see that the table of the transformation coeffi- 
cients is as follows in this case: 


a n a i?. o 


cos <p sin q> 0 

a 21 ^22 9 

= 

— sin <p cos<p 0 

0 0 1 


0 0 1 


The components of the inertia tensor in the reference frame we 
are interested in are obtained from the components of the tensor 
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reduced to the principal axes according to the transformation for- 
mula (X.10). Let us represent the components of the tensor (23.10) 
in the form Ii m = 6 (m 7 m . Therefore, denoting the components of 
the tensor in the new “primed” frame simply by I ih (without a 
prime), we can write 

I ik — 2 tl&hm^lml m — 2 
l, m m 

According to this formula 

In = 2 a; m /m = ay i + aj,/a = cos 2 <p/ 4 + sin 2 <p 7 2 

m 

Similar calculations show that 7 2S = sin 2 <p7j + cos 2 cp7 2 , I 33 = 
= 7 3 » hi = I 'ii = sin cp cos cp (7 2 — Ij). I 13 = / 31 = 0, 7 23 = 

= 7 32 — 0. Hence, the inertia tensor in the new frame has the form 

(111 1 1 2 0 \ 

<*«) = ki I ii 0 (23.21) 

\0 0/ 3 / 

We must note that when / x = I 2 — I, it follows from the for- 
mulas we have obtained that I u = / 22 == 7, and 7 12 = 7 21 = 0, 
i.e. that the new tensor does not differ from the initial one. This is 
exactly how matters should stand because when the moments 7 t 
and 7 2 are equal, the principal axes x and y are not fixed. 

24. Angular Momentum of a Rigid Body 

As in the preceding sections, we shall consider the motion of a 
rigid body in the reference frame K with the axes X lt X«. X 3 . We 
shall rigidly associate with the body the frame K’ whose origin will 
first be placed at an arbitrary point A. We shall designate the axes 
of this frame by x lt x z , x 3 (or x, y , z). We shall mentally divide the 
body into particles of mass m a . 

By formula (21.8), the velocity of the a-th particle in the frame 
K is 

V tt = V A + lw, ri A) ] (24.1) 

where V A is the velocity of the origin of the frame K', <o is the angular 
velocity of rotation of the body in the frame K , and r^ ) is the 
position vector of the particle emerging from the point A. 

Let us find the angular momentum M( A > of the body relative to 
the origin of the frame K' (relative to the point A). The position 
vector leading from the point A to the a-th particle is ra A \ Hence, 

M (A) = 2 [4 A) , m a V 0 ] 

a 
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Substitution into this expression of the value (24.1) for V a yields 
M(A) = 2 f r a A, i m 0 V A ] + 2 [ r a A) » m a r**']] (24.2) 

a a 

The second term in (24.2) is the value M( A) which the angular momen- 
tum would have provided that the point A were stationary. Con- 
-sequontly, M ( ' A ) is the angular momentum due only to rotation of 
a body. 

Let us transform the first term in (24.2) using the distributivity of 
a vector product: 

2 K A \ to.Va J = 2 Nar< A >, V A ] = f(S V A 1 

= K A) ,V A ] = [r(A) I mV A J 

Here m is the mass of the body, and rc A) is the position vector of the 
■centre of mass in the frame K' (the position vector from A to C). 
Expression (24.2) can thus be given the form 

M(a, = M ( 'a) + [r<. A) , wV A j ( 24 . 3 ) 

We have already noted that the term M ( ' A) is due to rotation of the 
Lody. It can be called the proper angular momentum of the body. 
'The second term is due to the translational motion of the body. 

If we place the origin of the frame K' (i.e. the point A) at the 
•centre of mass of the body, rc A) vanishes, and as a result (24.3) be- 
comes M(o = M(' C ). It thus follows that the angular momentum of 
a body relative to its centre of mass observed in the stationary frame 
K coincides with its proper angular momentum, i.e. it is determined 
•only by rotation and does not depend on whether the centre of 
mass of the body is moving or is at rest. 

Let us find an expression for the angular momentum 1 of a body 
Telative to its centre of mass. If the point A coincides with C , the 
.first term in (24.2) vanishes. Consequently, 

M(c> = 2 m a l r a. I®. r a ]] (24.4) 

- a 

•(recall that r„ is the position vector of a particle emerging from 
point C ). Let us transform this expression according to formula 
(VI. 5): 

M(c> = 2 {<ar^ — r a (cor a )} = 2 m a {« (2 x\ a ) — r a (2 a** ft<* )} 

a a I ft 

•(we have expressed the scalar products in terms of projections of the 
relevant vectors onto the axes of the frame K' associated with the 
body). 

1 Observed in the frame K. In the frame K' , the body is at rest, so that the 
angular momentum in this frame is always zero. 
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Let us calculate the components of the vector M (C) along the axes 
>f the frame K' (in the following formulas we shall omit the sub- 
icript “C” in parentheses). For the component along the i-th axis, 
ve obtain 

Mi = S m a (©i (}] xia) — x ia (X © h x ha )} 

a l It 

Let us write ©* in the form 

©i — 7 ] 

ft 

Hence 

M t = E m a { y, 8 ih a h (V xf a ) — x la E a h x ka } 

a ft l It 

Finally, let us change the sequence of summation over the subscripts 
x and k: 

= 2 to,, m a {8 ih (}] xl a ) — x ia x ha }] 

ft a l 

The expression in brackets is a component of the inertia tensor I ik 
[see formula (23.6)3. Consequently, for the projection of the vector M 
onto the i-th axis of the reference frame associated with the body, 
we get the following expression 

M i = 7 J Lift©,, (i = 1,2,3) (24.5) 

ft 

(do not forget that is the projection of the vector © onto the 
Ar-th axis of the frame K'). 

Examination of formula (24.5) shows that the vectors M and © 
are, in general, not collinear to each other. If the axes of the frame 
K' (i.e. the axes x, y, z) are directed along the principal axes of 
inertia of a body, formula (24.5) becomes simplified as follows: 

Mi = 7j©i (i = x, y, z) (24.6) 

Here /,• is the i-th principal moment of inertia. 

Assume that a body rotates, for example, about the third prin- 
cipal axis of inertia. Therefore, ©* = ©;, — 0, and © 2 = ©, so that 

M = M z = 7 z a z = 7 Z © 

The last relation can be written in the vector form: 

M = 7 Z © (24.7) 

A glance at formula (24.6) shows that for a spherical top (i.e. a 
body for which 7j = 7 2 = 7 3 = 7), the vectors M and © will also 
be collinear, and M = 7©. 

If a body rotates about the z-axis fixed in it that does not coincide 
with any of the principal axes of inertia, ©*=©;, = 0, and © z = ©. 
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Formula (24.5) therefore yields 

M x = 7 vz io, M y = 7 tfZ co, A7 Z = 7 2z cd 

The projection of the vector M onto an axis passing through the 
point relative to which M has been determined is called the angular 

momentum of the body relative to 
this axis. Consequently, the angular 
momentum relative, for example, to 
the 2 -axis is 

M z = 7 ZZ © (24.8) 

We must stress that the vector M itself 
in the last case is not collinear to the 
vector to and rotates about the direc- 
tion of © together with the axes x 
and y. 

In conclusion, let us consider the 
case when a body rotates about a sta- 
tionary (in the frame K) axis fixed in 
it that does not pass through its centre 
of mass C (Fig. 24.1). Let us take an 
arbitrary point A on the axis of rota- 
tion. We draw a vector which we 
shall designate by the symbol a from 
Therefore, for each of the body’s particles 

ia A) = a + r a (24.9) 

where r„ A) is the position vector of the a-th particle emerging from 
the point A , and r a , is the position vector of the same particle from 
the point C. < 

According to our assumption, the axis of rotation is stationary. 
Consequently, the velocity Va equals zero. Let us find the angular 
momentum of the body relative to the point A , i.e. M (A) . By for- 
mula (24.2) 

a 

We introduce into this expression the value of r ( c2 from (24.9): 
M ( a) = S f(a + r a ), m a [o>, (a+r a )]] (24.10) 

a 

Taking advantage of the distributivity of a vector product, after 
simple transformations we can write expression (24.10) as follows: 

M ( a) = [a, [tea]] (2 m a ) + [a, [«, (2 ™ a r a )]] 

+ [(2 m a r a) > [®a]] + 2 [r<x. I©r a ]] ' (24.11) 



i;\ 


this point to the point C. 
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Since 2 m a r a ~ mr c — 0, the second and third terms in the expres- 
sion we have obtained vanish. Using formula (VI. 5) and substituting 
the mass m of the body for 2 m a> we transform the first term into 
m [©a 2 — a (©a)] = m ((af, + a 2 x ) © — <?||(oa], 

where an is the component of tlio vector a parallel to ©, and ax 
is the component of a perpendicular to © (a x is the distance between 
the axis of rotation and the 2 -axis parallel to it and passing through 
the point C ). 

Finally, a comparison with formula (24.4) shows that the last 
term in (24.11) is M (C )— the angular momentum of the body relative 
to the point C. Hence, in the case shown in Fig. 24.1 

M(a> = rn (af, -f ajjj © — ma ( |©a -f M (C) (24.12) 

Let us find the component of the angular momentum (24.12) along 
the axis of rotation. We shall designate it The component of 
the first term equals this term itself. Since coa ( | = auto, the com- 
ponent of the second term can be written as maf,(o. By formula (24.8), 
the component of the third term is /„©, where I zz is the moment 
of inertia of the body relative to the 2 -axis (passing through the 
point C). Consequently, 

M|| A) = metj.© + I zz a> = ( ma 2 ± -f I zz ) © — (24.13) 

Here is the moment of inertia of the body relative to a fixed 
axis of rotation passing through the point A determined by the 
parallel axis theorem [see formula (23.19)]. 

We must note that the vector M (A ) itself, in 
general, does not coincide in direction with 
the vector ©. 

25. Free Axes of Rotation 

Free axes of rotation of a body are defined 
to be axes that retain their position in space 
without the action of external forces on them. 

We shall prove in the present section that only 
the principal axes of inertia can be free axes 
of rotation. 

Assume that a body rotates at the angular 
velocity © about a fixed axis associated 
with it (Fig. 25.1). The following acceleration must be imparted 
to each particle of the body: 

w a = — © 2 p tt 

where p 0 is the component of the position vector r a 
of the given particle perpendicular to the axis of rotation 



Fig. 25.1. 


7 * 
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(r a emerges from the point 0 on the axis r of rotation 1 ). 

To impart such an acceleration to a particle, the force 

F„ = m a w a = — m a o) 2 p a (25.1) 

must be applied to it whose moment relative to the point 0 is 

N a = [r a , F a ) = —m a co 2 [r a , p a ] (25.2) 

By summating all the forces (25.1), we get the resultant external 
force that has to be applied to a body to ensure its rotation about the 
axis being considered 2 : 

F= 5 F a — — <D 2 2 MaPa (25.3) 

a a 

The resultant moment of the external forces must equal the sum 
of the moments (25.2): 

N = VN t =-ffl 2 Em a [r.,p.J (25.4) 

a a 

Let us associate with a body a system of coordinates having its 
origin at the point 0 and its z-axis directed along the vector ©. The 
components of the vector p a along the axes of such a system are 
z a , y a , 0. Consequently, 

e x Cy e z 

[ , 'ai Pal = x a Va z a 

x a !Ja 0 

so that the components of the vector [r a , p a ] are 

fr a , Pa-lpr.* = J/o-0 — z a y a = —y a z a 

Pa^pr.p “ z a x a ~ X <x z a (25.5) 

f r ai Palpr.z = x a y a Ua x a ~ 0 

Now let us write the components of the resultant force F and the 
resultant moment N. By formula (25.3), we have 

F x — — CD 2 2 ^a x a = — U> 2 mx c 
' a 

F y = — G> 2 2 ^al/a = — ® 2 myc 

a 

F z = 0 

1 In accordance with the notation we have adopted, this vector ought to be 
designated by the symbol ra’. But since in this section we shall not encounter 
position vectors emerging from the point C, we shall suppress the index “(0)” 
on the symbols of vectors and coordinates to simplify our notation. 

? The forces F a include both external and internal forces, but it is general 
knowledge that the resultant of the internal forces is zero. 
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where x c and y c are the coordinates of the body’s centre of mass. 
If the axis of rotation passes through the centre of mass, these coor- 
dinates will be zero, so that all the components of the force and, 
consequently, the force F itself, will vanish. 

By formulas (25.4) and (25.5), we obtain 

A 1 '* = — o> 2 m a ( — y a z a ) = — (o Z l yz 

a 

N v = - ® 2 2 m a ( x a z a ) = © 2 /„ 

a 

N z = -© 2 2> a - o = o 

a 

where I yz and I X1 are the centrifugal moments of inertia of the body 
[see formula (23.9)]. If the z-axis about which rotation occurs is one 
of the principal axes of inertia, the centrifugal moments I : - z and I, JZ 
are zero [see (23.21)1 so that all the components of the resultant 
moment of the forces and, consequently, the moment N itself vanish. 

We have thus proved that when a body rotates about one of its 
principal axes of inertia, the resultant of the external forces and the 
resultant moment of these forces equal zero. Hence, for such an 
axis of rotation to retain its position in space, no external forces 
are needed. 


26. Equation of Motion of a Rigid Body 

Let us take as the generalized coordinates determining the posi- 
tion of a body in the stationary frame K the Cartesian coordinates 
Xic, X 2 c.i X 3C of the centre of mass (the position vector R c corre- 
sponds to them) and the Euler angles cp, D, ap, and let us direct the 
axes of the frame K' associated with the body along its principal 
axes of inertia. 

We established in Sec. 23 that the kinetic energy of a rigid body 
consists of the energy of translational motion (23.2) and that of 
rotation, which with our choice of the axes of the frame K’ is deter- 
mined by formula (23.11). Hence, for the Lagrangian of a rigid 
body, we can write the following expression: 

L = ~ m\b + 1 (/,©* + I 2 o>l + / 3 W 2 ) - V (26.1) 

To obtain an expression for L in the generalized coordinates we 

have adopted, let us substitute Rc for V c and express the projections 
of the vector to onto the axes of the frame K' in terms of the Euler 
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angles [see (22.4)]. The result is 
L = y mRc + y {/ t (cp sin 'O' sin \|> -j- ft cos xjx) 2 


-}- / 2 (cp sin ft cos xjx — ftsinxj)) 2 4 -^ 3 (cpcosft-f xp) 2 } — U (R c , cp, O', xp) 

(26.2) 

(remember that I u I. t , / 3 are the principal moments of inertia of the 
body). 

Knowing the form of the function U (R c , cp, 0, ip), we can compile 
Lagrange’s equation and solve the relevant problem on the motion 
of a rigid body. Lagrange’s equation corresponding to the coordinates 
of the centre of mass has the form 



dU 

<?R C 


-W = F 


whence we get the equation of motion of the centre of mass of the 
body: 

mR c = F (26.3) 

where F is the resultant of the external forces acting on the body. 

To obtain an equation determining the variation of the angular 
momentum M of a body with time, let us remember that for an indi- 
vidual particle 

Va = tr a , F a l = N a 


i.e. the time derivative of the angular momentum equals the moment 
of the force acting on the particle. Summation over all the particles 
of a body yields 

M = - F al = N (26.4) 

a a 


where N is the sum of the moments of all the external forces acting 
on the body relative to the point C (the sum of the moments of the 
internal forces is zero). 

Let us write Eq. (26.4) in projections onto the axes of the frame K' 
(onto the principal axes of inertia of a body): 

4r M i = N i (* = 1.2,3) (26.5) 


The projection N i can be written as 

, r 6i7 dU dL 

yv< 6®i ~ d®t ~ d®t 


(26.6) 


where 6^ is the angle of rotation of the body about its i-th principal 
axis (a>i = d<bjdt). Indeed, when the body rotates through the angle 
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60,-, the forces applied to it do the work 

5/1 = 2 FSR = 2 F [ (6<X> f e f ) , r] 

Here summation is performed not over the subscript /. but over all 
the external forces acting on the body. 6R is the displacement of the 
point of application of the relevant force, e, is the unit vector of the 
i-th principal axis, and r is the position vector of the point of appli- 
cation of the force emerging from the point C (R extends from the 
origin of the frame K). After a cyclic transposition of the factors 
and putting the common factor outside the sum sign, we 

arrive at the expression 

6 A = 8<D £ e, 2 [ r F ] , — SO^e, N = NfiO t 

where N ( is the projection of the resultant moment of the forces 
onto the axis about which rotation through the angle Sdh has oc- 
curred. The work 6/t we have calculated is done at the expense of 
the decrement of the potential energy U, i.e. 

8A = Ni 6<Di = —fit/ 


whence follows formula (26.6). 

We have obtained formula (26.6) in considering rotation about 
one of the principal axes of rotation of a body. This formula also 
holds in the most general case— for rotation about an arbitrary 
axis (naturally, provided that the force whose moment is being 
considered is a potential one). 

A glance at formula (26.6) shows that the quantities A r ; are gener- 
alized forces corresponding to the generalized coordinates <I> £ [see 
(4.20)1. 

Let us now differentiate function (26.1) with respect to co,. The 
result is 


dL 
dc o z - 


/jCDi = M I 


[see formula (24.6)1. The projection of the angular velocity co ; can 

be represented as [see the text following formula (26.6)1. We 
can therefore write that 


= (26.7) 

5t0 ‘ dVt 

[compare with formula (11.3)1. It follows from formula (26.7) that 
the quantities M ; are generalized momenta corresponding to the 
generalized coordinates <t> ; [see (4.19)1. 

Using relations (26.6) and (26.7), we can represent formula (26.5) 
as follows: 

d dL dL 

di>i ~ d ®> 


dt 


(26.8) 
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i.e. as Lagrange’s equation corresponding to the generalized coor- 
dinate <t> f . 

It should be borne in mind that with our selection of the axes of 
the frame K' only the Euler angle corresponds to rotation about a 
principal axis. The other angles correspond to rotations about the 
fixed axis X 3 and about the nodal line. 

By differentiating function (26.2) with respect to i|>, we shall find 
an expression for the projection of the angular momentum onto the 
axis x 3 (onto the z-axis) in terms of the Euler angles: 

My i = M 3 — 1 3 (<p cos d + 1 |)) (26.9) 

Assume that the Euler angle ft equals zero. This signifies that the 
Z- and z-axes coincide all the time— the body rotates about an axis 
associated with it that is fixed in the frame K. In this case, the sum 
of the angles cp -j- ip determines the total angle of rotation of the 
body about the z-axis. The position of the nodal line in this case is 
indefinite— it may be located at any place between the X- and 
z-axes. Particularly, the nodal line can be made to coincide with the 
X-axis. Now q> = 0, and rotation of the body about the z-axis will 
be characterized by the angle of proper rotation ip. If we make the 
nodal line coincide with the z-axis, the angle ip will vanish, and 
rotation about the z-axis will be described by the precession angle cp. 

When = 0, formula (26.9) becomes 

M 3 = I 3 (<p 4 ip) = i> 3 

where co 3 is the angular velocity of rotation of the body about the 
z-axis. 

When ■& — ji/ 2, formula (26.9) is simplified as follows: ! 

• 

i|) == I\/[ 3 == L{ip z=z I s&y 

where toy is the angular velocity of proper rotation of the body. 

In concluding, we shall compile Lagrange’s equation for a body 
rotating about an axis rigidly associated with it that is fixed in the 
frame K. To be more general, we shall consider that the axis of 
rotation does not pass through the centre of mass C of the body and 
is not parallel to any of its principal axes of inertia. 

We direct the axes Z and z along the axis of rotation of the body. 
Hence, the Euler angle ■& will be zero. Since the nodal line is not 
fixed in this case, we shall make it coincide with the z-axis, and the 
angle ip will also vanish. With our choice of the z-axis (along the 

direction of the vector o>), we have co x = = 0, and © z = w — • <p. 

We have calculated the kinetic energy for such a case on an earlier 
page. It was equal to the value given by (23.20). Consequently, the 
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Lagrangian has the form 

L = \ I[f o ) 2 - U (q>) = 3 - ~ «/ (<p) (26.10) 

(/I is a point not coinciding with C at which the origin of the frame 
K' is). 

We compile Lagrange’s equation: 

d dL _ dL 

dt d'<f ~ 5( P 

It follows from (26.10) that the left-hand side equals a). By 
(26.6), the right-hand side, equal to —dU/d cp, is the projection of the 
resultant moment of the forces onto the axis of rotation, i.e. N t . 
We thus arrive at the equation 


/(A) ; = n z 


27. Euler's Equations 


Lagrange’s equations corresponding to the Euler angles (i.e. 
describing the rotation of a body), as can readily be concluded from 
the form of the function (26.2), are very complicated. It is sometimes- 
more convenient to use other equations that were obtained by L. Euler 
and bear his name. To arrive at these equations, we shall proceed from 
relation (26.4): 



(27.1) 


Equation (27.1) holds in an inertial reference frame (i.e. in the 
stationary frame K). We shall try to find an equation that holds in 
the frame K' rotating together with the body whose axes coincide 
with the principal axes of inertia of the body. 

In Eq. (27.1), dM is the increment of the vector M during the time 
dt, observed in the frame K. By (15.6), this increment can be written, 
as 

dM = d'M -f [d<p, M] 

where d'M is the increment of the vector M during the time dt, ob- 
served in the frame K ' , and dtp is the angle through which the frame 
K' rotates during the time dt. Dividing the last equation by dt, we 
obtain 

Tr=TT + [Tf . M ] < 27 - 2 > 

where cM/dt is the rate of the change in the vector M observed in the 
frame K, d'M/dt is the rate of change in the same vector observed in 
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the frame K\ and © is the angular velocity of rotation of the frame 
K' (i.e. the angular velocity of rotation of the body). 

Formula (27.2) holds for any vector, particularly, it also holds 
for the vector ©. In the latter case, we have 


da 

It 


Since [to©] = 0, we arrive at the relation 


d(x> d'<s> 

dt dt 


(27.3) 


from which it follows that the rates of the change in the vector © 
observed in the frames K and K' are identical. 

Let us substitute expression (27.2) for the left-hand side of for- 
mula (27.1). The result is the equation 

4r+[“M]-N 

Let us project all the vectors onto the i-th axis of the frame K ' , 
taking into account that the projection of the vector M onto this 
axis is Mi = J { oo f [see (24.6)]: 

~ 7 T -+ [^Mjpr. :C| = N i 

Knowing that I t = const and d'a/dt — dta/dt [(see (27.3)1, we can 
write the equation obtained as follows: 

, / 1 ~W~ 1®M] Pr . x t — Nt (* = 1.2,3) (27.4) 


Let us represent the projection of the vector product onto the 
axis xi by formula (VI. 33): 

[wM] pr . x =S 8 ik i<O k Mi= 'E.ZthluJim 

1 ft, i ft, I 

and introduce this expression into (27.4). As a result, we arrive at 
the equations 

(* = 1,2,3) (27.5) 

- ft, i 

which are Euler’s equations. Assuming consecutively that i = 1, 
i = 2, i = 3, and summating over k and l , we obtain three equations: 

If^t+(h-h)^ 3 = N i j 

i = iV 2 \ (27.6) 

/ 3 -^ L +(/2-A)C0 I © 2 = iV3 j 
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We must note that each following equation is obtained from the 
preceding one by means of a cyclic transposition of the subscripts 
1. 2, 3. 

It is easy to see that for a spherical top, Eqs. (27.6) transform 
into the equation /« = N. 


28. Free Symmetric Top 


Let us use Euler’s equations for studying the motion of a symmetric 
top (i.e. a body for which I 3 = / 2 =4= I 3 ) not experiencing the action 
of external forces. In this case, the centre of mass of the body moves 
at a constant velocity or is at rest [see formula (26.3)]. Therefore, 
from among all the inertial frames, we can choose the frame K 
whose origin coincides with the centre of mass C of the body. In 
this frame, translational motion of the body is absent, and it remains 
for us to establish only the nature of the rotation. 

In the absence of external forces, the angular momentum M of 
the body remains constant in magnitude and direction relative to 
the stationary frame K. Let us choose the direction of the vector M 
as the 7,- axis. We shall see that the vector M, generally speaking, 
constantly changes its direction relative to the frame K' associated 
with the body. 

Since 7 X = / 2 and the moment of the external forces is zero, Eqs. 
(27.6) have the following form 1 : 


/i^ + (/3-/i)co 2 a> 3 ^0' 

/ l-^- _ ( / 3- / ‘) ( °3CO 1 ==0 

n 

dt u 


(28.1) 


We directly obtain from the third equation that co 3 = C 0 |j = const. 
This signifies that the projection of the angular velocity vector onto 
the z-axis associated with the body remains constant. 

Introducing the symbol 

Q = ^=^-C0,| (28.2) 

1 1 

we can write the first two of Eqs. (28.1) as follows: 
co 3 — — £2 (o o , co 2 ” £2co 3 

It is a simple matter to see that the system we have obtained is 
satisfied by the functions 

(Oi = cox cos (Qf + a), co 2 = co x sin (Qf 4- a) (28.3) 

1 We remind our reader that Euler’s equations are written in a coordinate 
system whose axes coincide with the principal axes of inertia of the body. 
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where © x and a are constants, © x = ]/© x 4 ©* being the magnitude 
of the projection of the vector © onto the plane xy perpendicular to 
the axis of proper rotation of the body — the z-axis. Inspection of 

(28.3) shows that the component ©x perpendicular to the z-axis, 
remaining constant in magnitude, rotates uniformly in the plane xy 
at the angular velocity Q determined by formula (28.2). We have 
seen that the component ©u parallel to the z-axis also remains con- 
stant in magnitude. We thus conclude that the vector © rotates 
relative to the body at the angular velocity Q, describing a cone 

about the z-axis, the magnitude of the 
vector © remaining unchanged (Fig. 28. 1 1 ). 

According to (24.6), the projections of 
the vector M onto the axes x, y, z are 

M x = 7 X © t, M y = 7 X © 2 , M z = / 3 © 3 

Hence, 

M = Ii^x + 7 X © 2 e y + 7 3 © 3 e 2 

= / x (©ie* + © 2 ej,) + 7 3 ©„e 2 

where e x , e y , e 2 are the unit vectors of 
the relevant axes (these axes rotate to- 
gether with the body). The sum © X e x . + 
-F© 2 ey gives the component © x perpen- 
dicular to the z-axis; © 3 e 2 is the component ©n parallel to the z-axis. 

W PT1PA 

M = 7 x ©x + 7 3 ©„ (28.4) 

The directions of the vectors M and ©u pass through a common 
origin for the frames K and K', i.e. the point C. Consequently, 
these vectors determine a certain plane (the plane Zz). For equality 

(28.4) to be observed, the vector ©x must be in the same plane. 
Hence, the vector © = ©x -I- ©n is in the plane Zz. We thus conclude 
that the vectors M, ©, and the axis z of proper rotation of the body 
are in the same plane at each instant (this plane is hatched in 
Fig. 28.1). The plane rotates about the direction of M. The z-axis 
rotates together with it and describes about the Z-axis a circular 
cone. Such rotation of the body’s z-axis is called regular precession. 
The latter is characterized by a constant nutation angle 'd. 

We have established that the vector © rotates relative to the 
body about the z-axis at the angular velocity Q. At the same time, 
this vector remains in the plane Zz. It thus follows that relative to 


1 In practice, a body of revolution is taken as a symmetric top. We have de- 
picted a body of an irregular shape in the figure, however, to underline the fact 
that the only condition for a body to be a symmetric top is the equality of two 
of its principal moments of inertia. 
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the plane Zz, the body rotates about the z-axis in the opposite direc- 
tion at the same velocity Q. 

Figure 28.2 shows the components ^©x and / 3 ©|, whose sum 
gives the angular momentum M [see (28.4)]. A glance at the figure 
shows that 

tan »=-iSt- (28 - 5) 

We established earlier that © x and con 
are constants. Therefore, tan and, 
consequently, the nutation angle itself 
remain constant. The angle between the 
vector <o and the z-axis is also constant 
(its tangent is co x /con). 

The quantities © x and ©| ( are deter- 
mined by the body’s kinetic energy T 
and the angular momentum M. Irideed, 

T ~ ~2 (A©J + 1 1®2 + 1 . 3©!) = ~y (I i©i + ^ 3 ©?i) \ ( 28 . 6 ) 

M* = (/,©,)* + (/ l( o 2 )2 + (7 3 o> 3 ) 2 = I \ coi + /*©& i 

Solving this system of equations for © x and ©n, we find the expres- 
sions of these quantities in terms of T and M: 



©11 = 


l/ 1 

V L IAh-h) ’l 


©A 


l/ M 2 — 2TI 3 

V hih-h) 


(28.7) 


Assume that the body is flattened along the z-axis. Hence, I 3 > I u 
and the denominator in the radicand of the expression for ©n will 
be positive and for C 0 j_ , negative. Consequently, for ©n and © x to be 
real, it is necessary that the conditions M* — 2TI X >= 0 and 
M 2 — 2TI 3 ^ 0 be satisfied; they can be combined into a single 
formula; 


2/3 ^ ^ 21 i 


(28.8) 


The energy of a free symmetric top cannot have values beyond the 
indicated limits (at a given M ). If the energy has its lowest possible 
value, i.e. at M 2, = 2TI 3 , the quantity © x vanishes [see (28.7)]. 
Examination of formula (28.5) shows that in this case d = 0-~the 
axes Z and z coincide, the directions of the vectors M and © also 
coincide, and the vector © does not move relative to the body or 
relative to the frame K. We must note that in this case the following 
relation holds: 


M z 

2 /., 


(28.9) 
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Assume that the energy has its highest possible value, i.e. M 2 = 
— 27’/ 1 . Now Q|| vanishes [see (28.7)]. By formula (28.5), we have 
■ft = ji / 2 in this case— the axes Z and z are mutually perpendicular, 
there is no rotation about the z-axis, the vector © = © ± coincides 
in direction with the vector M, and the energy is related to the angular 
momentum by the expression 

T = (28.10) 


The relations we have obtained can be given a fine geometric 
interpretation. Let us rewrite formulas (28.6) as follows: 

©1 


©1 


2T/I 1 1 277/i 


©f 


©1 


277/ s 

©3 


Af 2 //§ 


1 


1 


(28.11) 

(28.12) 


M 2 // 2 ' MVIl 

Each of these equations describes an ellipsoid of revolution. If, 
as we have assumed, I 3 > / l5 both ellipsoids are flattened along the 



Fig. 28.3. 

axis o) 3 , which coincides with the z-axis (an ellipsoid of inertia in 
this case, conversely, is extended along the z-axis). It is a simple 
matter to comprehend 1 that the first ellipsoid (let us call it an 
energy ellipsoid) is flattened less than the second one (which we shall 
call the angular momentum ellipsoid). Figure 28.3 shows both 
ellipsoids. The values of ©j, © 2 , “3 satisfying Eqs. (28.11) and 
(28.12) are determined by the lines of intersection of both ellipsoids. 
It thus follows that the tip of the vector © must slide along this 
line of intersection. Consequently, the vector © rotates relative to 
the axes a lt © 2 , © 3 , describing a cone about the axis © 3 . It must be 
remembered that ©j is the projection of © onto the i-th principal 


1 For this purpose, we must take into account that /§// f > J a /7 j. 
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axis of inertia of the body. Hence, the axes o o t coincide with the 
axes x, y. z. Rotation relative to the axes to,- thus signifies rotation 
relative to the body itself. 

We can increase M with T remaining constant until the energy and 
angular momentum ellipsoids have only two common points. This 
occurs provided that Y‘2T/I 3 = M/I a [compare with (28.9)]. In 
this case, the vector w coincides in direction with the z-axis. If 
we continue to increase M, i.e. assume that M' Z /2I 3 > T. the ellip- 
soids stop touching each other, and the system of equations (28.11) 
and (28.12) has no common solutions. Such a case cannot be realized. 
Consequently, we have obtained the lower boundary for T [see 
(28.8)]. 

By diminishing M with T remaining constant, we arrive at a 
situation when both lines of intersection of the ellipsoids merge 
into a single line of contact in the equatorial plane. This occurs 
provided that ]/ 2TII X = MII X [compare with (28.10)]. In this case, 
the vector o> rotates about the z-axis, constantly remaining per- 
pendicular to it. If we continue to decrease M, i.e. assume that 
T > the ellipsoids stop touching each other, and the system 

of equations (28.11) and (28.12) has no common solutions. We have 
thus arrived at the upper boundary for T [see (28.8)]. 

29. Symmetric Top in a Homogeneous Gravitational Field 

Consider the behaviour of a symmetric top with one fixed point 
in a homogeneous gravitational field. We must note that a general 
solution 1 of the problem on the motion of a body with one fixed 
point in a homogeneous gravitational field can be obtained only in 
three cases: 

(1) for an asymmetric balanced top (a top is called balanced if the 
fixed point coincides with the centre of mass of the top). This case 
is known as Euler’s problem; 

(2) for a symmetric unbalanced top (the fixed point does not coin- 
cide with the centre of mass) in which the centre of mass is on the 
z-axis — Lagrange’s problem; 

(3) for a symmetric unbalanced top for which /, = /„ — 2/ 3 , and 
the centre of mass is in the plane xy — the problem of S. Kovalevskaya. 

We shall consider Lagrange’s problem. The equations of motion 
in this case are integrated in a very complicated way. We shall 
therefore limit ourselves to writing the initial equations and dis- 
cussing their solutions. 

We place the origins of both coordinate systems K and K’ at the 
fixed point A of the top (at the point of support). We direct the 

1 That is, the solution obtained with the aid of quadratures with arbitrary- 
initial conditions. 
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jZ-axis of the stationary frame K along a vertical line, and the z-axis 
of the frame K' associated with the top along its third principal 
axis of inertia (I 1 = 1 2 / 3 ). With such a choice of the coordinate 

axes, the top’s potential energy has the form U = mgl cos d, where 
l is the distance from the supporting point to the centre of mass C 
(we assume that the coordinate z of the centre of mass, i.e. zq, is 
greater than zero). 

Let us find the expression for the kinetic energy in the given case. 
With a view to the point A being fixed, we can write 

T = Y2 m «[®- r a A) l 2 

a 

and perform the same transformations that we made for expression 
(23.3). As a result, we arrive at a formula which will differ from 
(23.5) only in containing the coordinates x\a\ x^\ etc. instead of 
the coordinates x ia , x fta , etc. Consequently, we obtain an expression 
similar to (23.8) for the kinetic energy: 

<. ft 

where iffl are the tensor components determined by formula (23.16). 
According to (23.18), these components are 

Ah 1 = Itk + M a ^ih ~ a t a k ) 

In our case, a x = a 2 == 0, and a 3 = —a = —l there a t is the i-th 
coordinate of the point A in the frame K{q), see (23.17)]. In addition, 
since the z-axis coincides with the third principal axis of inertia, 
while the axes x and y are parallel to the two other principal axes 1 , we 
have Iik = Ifiih- We can thus conclude that the tensor iffi is 
diagonal, its non-zero components being 

/(A) = + ml 2 , /(A) = /, + ml 2 , 7JA) = / 3 

Hence, taking into account that 7 X we obtain the following 

expression for the kinetic energy: 

T — ~2 ' KA + (©? + ®l) + -^s] 

Introducing values (22.4) for the components of © into this expres- 
sion, we obtain 

T — -J- [(/j + ml 2 ) (cp 2 sin 2 ft -f ft 2 ) -f I 3 (tp cos ft -f- \})) 2 ] 


1 In a symmetric top, any two mutually perpendicular axes perpendicular 
to the axis of symmetry may be principal axes of inertia. 
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Let us write the Lagrangian 1 
L = y (/j + ml 2 ) (<p 2 sin 2 ft + ft 2 ) 

+ y 1 3 (<pcosft-f-\p) 2 — rogZcosft (29.1) 

The coordinates tp and ij) are cyclic [see the text in Sec. 11 related 
to formula (11.2)). Therefore, the generalized momenta p v and 



are integrals of motion. The energy E is a third integral of motion. 
We thus have three equations: 

p t p = <9L/chp = [(/i -{-ml 2 ) sin 2 ft + / 3 cos 2 ft] ep 4- / 3 cos ftip = M z = const 

• • • 

— dL/dty — I 3 ((p cos ft + 1 |)) = M t — const 2 
E — T -\-U — const 

An analysis of the solutions of these equations leads to the follow- 
ing results. The angle ft varies periodically within the limits from 
ft x to ft 2 determined by the initial conditions (particularly, by the 
relation between the energy and the angular momentum of the top). 
The oscillations of the top’s axis corresponding to the variations of 
the angle ft are called nutation. Simultaneously, the axis of the top 
precesses, i.e. turns about the Z-axis. As a result, the apex. i.e. the 
point of intersection of the z-axis (the axis of the top) and a sphere 
of unit radius, describes one of the curves shown in Fig. 29.1. The 

sign of the derivative cp either remains unchanged (Fig. 29.1a and b) 

or changes (Fig. 29.1c). Case (b) occurs when <p and ft simultaneously 
vanish. 


1 The force applied to a top at its point of support is a reaction of the con- 
straint, which, as we know, is not included in Lagrange’s equations. 

2 Compare with (26.9). 
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The nature of the behaviour of <p, like the values of and •&,, 
depends on the initial conditions. Motion of the top’s axis such as 
that in case (b) corresponds to natural initial conditions when the 
top is first brought into rotation about its axis, after which the axis 
is released and begins its motion. At the moment of axis release, 

both cp and ■& equal zero. In these initial conditions, the top first 
inclines, and then upon reaching the boundary angle it begins to 
rise (see Fig. 29.16). 

In absolutely specific initial conditions, both boundary values of 
and fi , 2 coincide so that the top’s axis precesses without nutation. 
As we have already noted, such precession is called regular. To ob- 
tain regular precession, a top must be given an initial impetus of 
a quite definite magnitude and direction. 

For a “rapid" top (i.e. a top whose kinetic energy of proper rotation 
is high compared with its energy in a gravitational field), the action 
of gravitational forces may be disregarded in a first approximation. 
Consequently, the motion of the top can be represented as the free 
precession of the top’s axis about the direction of the angular momen- 
tum M, considered in Sec. 28 (this precession corresponds to nutation 
of a heavy top), onto which small disturbances due to the action of 
the gravitational force are superposed. These disturbances cause 
slow precession of the angular momentum M about a vertical line. 

Calculations show that the more rapidly a top rotates, the smaller 
is the amplitude of nutation. In addition, in a real rapid top, nutation 
is damped by friction at the support. Therefore, in practice the 
nutation of a sufficiently rapid top is sometimes unnoticeable, and 
the top seems to uniformly precess about a vertical axis. Since such 
precession is regular only approximately, it is known as pseudo- 
regular precession. , 
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30. Hamilton's Equations 

In solving problems on the motion of a system with s degrees of 
freedom using Lagrange’s equations, we have to solve a system of 
s second-order differential equations. The generalized coordinates q k 

and the generalized velocities q k are the independent variables in 
these equations. 

W. Hamilton obtained equations of motion in which the general- 
ized coordinates q h and the generalized momenta p h are the indepen- 
dent variables. Hamilton’s equations or, as they are also called, 
canonical 1 equations ( q h and p h are accordingly called canonical 
variables), unlike Lagrange’s equations, are first-order differential 
ones. But on the other hand, their number needed to describe a 
system with s degrees of freedom is 2s. 

Hamilton’s equations can be derived either from Lagrange’s- 
equations, or directly from the principle of least action (we shall 
give both derivations below). It is natural that they give nothing- 
novel in essence. But canonical equations are more symmetric than 
Lagrange’s equations and, in addition, being invariant with respect 
to canonical transformations, they unveil great possibilities for 
generalizations playing an important role in electrodynamics, sta- 
tistical physics, and quantum mechanics. 

Hamilton took the energy (5.1) expressed in terms of the variables 
q h and p h as the function characterizing a mechanical system. Hav- 
ing in view that 

Pa = #- (30.1) 

dQh 


1 Hamilton’s equations are called canonical because they remain invariant 
upon quite general transformations of the variables. With the aid of such ca- 
nonical transformations, we can pass over from the variahles q h and p h to other 
canonical variables: Qi (q k , p k . t) and Pi (q k , p k , t). Here Hamilton’s equations 
retain their form, true, with the new Hamiltonian H' ( Qt , Pi, t) that replaces 
the function II ( q h , p h , t). The variables Qi and Pi may nave a different physical 
meaning than the variables q h and p k . 
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Isee (4.19)], let us write this function as follows: 

Pft. p k q k — L(q h , q h , t) (30.2) 

ft 

• 

(q k is assumed to be expressed in terms of q h and p h ). The character- 
istic function H is called the Hamiltonian function or simply the 
Hamiltonian. 

We shall give as an example the Hamiltonian of a particle moving 
in the potential field U = U (x, y, z, t ): 

H==~{p% + Py + pl) + U (x, y, z , t) (30.3) 


For a particle moving in a stationary field, H has the same form, but 
V does not depend explicitly on t. 

Let us derive Hamilton’s equations proceeding from those of 
Lagrange. For this purpose, we shall find the total differentials of 
the left-hand and right-hand sides of formula (30.2) and equate 
them to each other. The total differential of the left-hand side is 


<®“2 -?£■!»»+ 2 + ( 30 . 4 ) 

' ft ft 

The differential of the right-hand side is 


dH =•- 2 p h dq, i + 2 <lhdp h — ^,-^dq k — 2 dq h dt 


dqh 


, (30.5) 

In view of relation (30.1), the first and fourth sums cancel each other. 
It follows from Lagrange’s equation (4.16) 1 that 


dL 

dqh 


d dL 


dt 


dqh 


■Ph 


(30.6) 


.Let us substitute p h for dLldq h in the third sum of formula (30.5). 
As a result, expression (30.5) becomes 

dH = 2 9* d P* — E Pkdq h — dt (30.7) 

ft ft 

For expressions (30.4) and (30.7) to be equal at arbitrary values 
of dq h , dph, and dt, the following conditions must be ob- 


1 We assume that all the forces acting in the system are potential ones; 
this is why we use Eq. (4.16) instead of (4.15). 
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served: 


9k 


dff 
dPh ’ 



(fc= 1, 2, .... s) 


WH d_L_ 

dt ~~ dt 


(30.8) 
(30. 9> 


Equations (30.8) are the required Hamilton or canonical equations.: 
As we have already noted, they are first-order d fferential equations.. 
Their total number is 2s. 

For a particle described by the Hamiltonian (30.3), Hamilton’s 
equations have the form 


Px = 


au 


dx 


— , etc. 

m ’ 


Now let us derive Hamilton’s equations from the principle of 
least action. We remind our reader that according to this principle, 
a system moves so that the action S [see (7.1)] has the smallest poss- 
ible value. This statement is written in the form 

65 = 8 j L(q h , q h , t)dt — 0 

u 

Let us introduce into this equation the value of L obtained from 
relation (30.2): 


ft 

6 j ('Z Ph q h -H)dt = 0 (30.10) 

U h 

The variation on the left-hand side of (30.10) can be written as 
65 = j 2 (Ph^<Jh + Qh^>Ph — 8g k 1^-6 Pk) dt 

U h 

Let us integrate the first term by parts: 

p* • f t f * * 

J Ph^Qh dt — p k 8q h — j p h 8q h dt 
ti u u 

Here we have taken advantage of the fact that 6 q k = (5g fe ) 
[see (III. 4)]. The variations 6q h vanish when the integration limits 
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are introduced 1 . Therefore, the first term of the expression obtained 
must be discarded. Consequently, condition (30.10) will, be written 
as follows: 

U h 

Owing to the arbitrary nature of the variations 6 q h and 6p k , this 
condition can be satisfied only if the expressions in parentheses will 
be zeros. Hence we directly obtain Eqs. (30.8). 

Let us investigate the Hamiltonian H. We find the total derivative 
of this function with respect to time: 

dH dff ■ yt .. dH • th dH • 

dt dt 2 j dqh Qh' 2-i gpfr P* 

k h 


• • 

Taking into account values (30.8) for q h and Ph, we find that 


dH dH 
dt dt 


(30.11) 


Thus, if the function H does not depend explicitly on the time, it 
retains its value. This was to be expected because H is the total 
energy of a system that is retained provided that 


dLldt — 0 [see (30.9)]. 


Let us replace p k in the second of equations (30.8) in accordance 
with (30.6). As a result, we find that 


V 


dH dL_ 

d<!h dqk 


(30.12) 


It thus follows that the generalized coordinates which are cyclic, 
i.e. which are not contained explicitly in the function L , are also 
not contained explicitly in the function H. On an earlier page [see 
.(11.2)], we established that the generalized momenta corresponding 
to cyclic coordinates are integrals of motion. We can conclude from 
what has been said that the generalized momenta corresponding 
to the coordinates q * not included explicitly in the Hamiltonian 
(i.e. cyclic relative to the function H) remain constant: 

p h — const provided that -|^- = 0 (30.13) 


1 We remind our reader that in the variation of trajectories in a configura- 
tion space (see Sec. 4), the initial and final points of the trajectories are assu- 
med to be fixed. 
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31. Poisson Brackets 

Let us take a function of the canonical variables g k and p h , and 
also of the time f, that is, / ( q k , p k , t) and establish in what condi- 
tions this function will be an integral of motion 1 . To do this, we 
calculate the total derivative of this function with respect to time: 



We substitute the values for the derivatives q h and p h from (30.8)j 

(JL1IL tLJIL\ nin 

dt dt ' \ dq h dPh dp k dq k ) \ ■ ) 

h 

If we have two functions <p ( q h , p h , t) and T ( q h , p h , t), the 
expression 

<*♦>“2 (-£•£-£-&) < 31 - 2 ) 

k 

is called the Poisson bracket for the functions (p and ij) 2 * . When nec- 
essary, the independent variables with respect to which the partial 
derivatives are taken are indicated as subscripts of the symbol of 
the Poisson bracket, i.e., for instance, bracket (31.2) is written as 
(cp, It is easy to see that 

{9.' {<P. ^>p. q = <P)p, 9 (31-3) 

Using the Poisson bracket, expression (31.1) can be represented as 

-3T—ir +</’ H >'-* < 31 - 4 ) 

or as 

4 = 4 - + (31.5) 

A glance at (31.4) shows that the condition in which the function / 


1 Do not forget that the functions of dynamic variables ( q h and q h or q k and 
p k ) that remain constant in the motion of a system are known as integrals of 
motion. 

2 Sometimes square brackets and even round ones (parentheses) are used to 

designate the Poisson bracket instead of curly brackets (braces). 
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is an integral of motion is 

//} = 0 (31.6) 

We thus conclude that when the integral of motion / does not depend 
explicitly on the time, its Poisson bracket with the Hamiltonian 
equals zero. 

Below are given some obvious properties of the Poisson bracket: 


jT 

-€■ 

II 

1 

(31.7) 

{<P, <p} = 0 

(31.8) 

{(<Pl + <P 2 ), 'I’} = {90 9} + {fp 2 , 'I’} 

(31.9) 

{(WPz). ^} = <Pi{<P 2 , ^} + <P 2 {<Pi, 

(31.10) 

*}+{*’ -ff} 

(31.11) 

Particularly, we may take canonical variables as tp or i|5, 
of them. This yields the following relations: 

or as both 

{?*. *}- £ 

(31.12) 

iPt' ^}= al 

(31.13) 

{?i, = 0 

(31.14) 

{ Pi , M = o 

(31.15) 

{?ii Pk} — &lh 

(31.16) 


We invite our reader to obtain formulas (31.12)-(31.16) as an exercise. 
In deriving them, take into account that dpi/dq h = 0, and dqj/dp k — 
— 0 (because q h and p h are independent variables). 

A very important property of the Poisson bracket is its invariance 
relative to canonical transformations. This signifies that 

{<P, = 9}<?,p (31.17) 

where Q and P are variables obtained from q and p with the aid 
of canonical transformations. 

In quantum mechanics, we shall acquaint ourselves with the quan- 
tum Poisson bracket that is the' quantum mechanical analogue of 
the classical Poisson bracket treated in this section. 
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32. The Hamilton-Jacobi Equation 


Variation of the action 



(32.1) 


in finding the true trajectory of motion of a system (we have in 
mind the trajectory in configuration space, i.e. in a space with s 
dimensions; s is the number of degrees of freedom) consists in com- 
paring the values of S for close trajectories with fixed ends, i.e. with 




identical values of q h (t ,) = g* 1 and quit*) — gk 2) . This can b& 
illustrated with the aid of Fig. 32.1. Only the trajectory for which S 
is minimum corresponds to actual motion (it is depicted by a solid 
line in the figure). 

In the present section, we shall consider the action S as a quantity 
characterizing motion along true trajectories, and study the behav- 
iour of this quantity upon changes in the point g< 2) (with t 2 — 
— const), and also upon changes in t 2 [the symbol g <2 > signifies the 
set of all the gJi 2) ’s]. We shall thus treat the action as the function 

S = S(q h , t) (32.2) 

where q h are the coordinates of the final position of the system, and t 
is the instant when this position is reached. 

Let us take near the point g< 2 > a point with the coordinate g< 2> + 
+ fig which the system reaches at the same instant t 2 in which it 
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arrives at the point g< 2 > (Fig. 32.2). The action for the trajectory bring- 
ing the system to the point g< 2 > -f fig differs from the action for the 
trajectory bringing the system to the point g< 2 > by the quantity 

65 = j 2 (-£ Sq h + fig, ) dt (32.3) 

(i ft d 1h 

Here fig, is the difference between the values of g, taken for both 
trajectories at the same instant t; similarly, fig, is the difference be- 
tween the values of g, at the instant t. 

Let us integrate the second term in (32.3) by parts: 

\ 1L fig, dt = J* fig, “ - ? ( ^ IL ) fig, * (32.4) 

(i J V dt dq k ' 

For the true trajectory, dL/dg, is the generalized momentum p,. 
"The origins of both trajectories coincide, hence fig, (£,) = 0. The 
•quantity fig, (f 2 ) can be designated simply by fig,. Consequently, 
the first term in (32.4) can be represented in the form p, fig,. 

Let us introduce (32.4) into expression (32.3): 

ft fl ft "9ft 

True trajectories satisfy Lagrange’s equations. Therefore the inte- 
grand and, consequently, the integral itself, vanishes. We have thus 
obtained the following value for the increment of the action S due 
to the change in the coordinates of the final position of the ? system 
by fig, (at a constant time of motion): 

6S = Ph^k (32.5) 

‘Here p, is the value of the momentum at the instant i». 

It follows from expression (32.5) that 

|H >» <32- 6 > 

‘Consequently, the partial derivatives of the action with respect 
to the generalized coordinates equal the corresponding generalized 
momenta. 

Now assume that the upper limit of integration in (32.1) is not 
fixed. To underline this fact, we shall write the action in the form 

t 

S = j Ldt 

h 


(32.7) 
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The action represented in this way is a function of the upper integra- 
tion limit, i.e. S = S ( t ). It can be seen from (32.7) that 


dS 

dt 


L 


(32.8) 


At the same time, in accordance with (32.2), we can write that 

dS dS . -sr\ dS ‘ dS . ‘ ,o 0 m 

~dT~~dT+ 2 5^-7* - ~df+ 2 Phlh (32.9) 

k h 

(we have taken relation (32.6) into account]. Equating the right- 
hand sides of expressions (32.8) and (32.9). we get the following 
value for the partial derivative of S with respect to t: 

h 

The expression in parentheses is the Hamiltonian H. Hence, 

■^-=-H(q k , p h , t) (32.10) 

In accordance with formulas (32.6) and (32.10), the differential 
of function (32.2) can be written as 

dS = 2 Ph dq h — H dt (32.11) 

k 

Let us substitute for the p&’s in Eq. (32.10) their values from 
(32.6) and write this equation as follows: 

dt S'*? d Ql ' dq 2 ’ •••’ dq a ’ 0 ( 32 ‘ 12 ) 

We have obtained a differential equation that must be satisfied 
by the function S (</,, q 2 , . . ., q a ; t). It is called the Hamilton- 
Jacobi equation. It is an equation in partial derivatives of the first 
order. 

Equation (32.12) is the cornerstone of a general method of integrat- 
ing equations of motion. But a treatment of this method is beyond 
the scope of our course. 

For a conservative system with stationary constraints, the time 
is not contained explicitly in the function H, and H = E = const 
[see (30.9)]. Consequently, according to (32.10), the dependence of S 
on t is expressed by the term — Et. Therefore, the action breaks 
up into two terms, one of which depends only on the generalized 
coordinates, and the other only on the time: 

S ( q h , t) = S 0 (q k ) - Et (32.13) 

The function S 0 (q h ) is called the contracted action. Introducing S 
in the form of (32.13) into Eq. (32.12), we arrive at the Hamilton- 
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Jacobi equation for contracted action: 


*( 


Qu ? 2 » 




dS 0 

dq 1 


dS t 

Sq 2 


ds o 
dQs 


) = E (32.14) 


The Hamilton-Jacobi equation plays an important role in optics 
and quantum mechanics. It underlies optical-mechanical analogy, 
which led E. Schrodinger to the formulation of wave mechanics. 

Let us write the Hamilton-Jacobi equation for a particle moving 
in a non-stationary potential field. Taking into account formulas 
(30.3) and (32.12), we obtain 


iH(-&r+(fr+(4) 2 }+^.^. o— s- (32.i 5 > 

If the field in which the particle is moving is stationary, instead of 
(32.15), an equation for the contracted action S 0 is considered: 


(/ dS „ \2 , / dS 0 , / dS 0 \ 


1 


2 


2 




Chapter VII 


THE SPECIAL THEORY 
OF RELATIVITY 


33. The Principle of Relativity 

The special theory of relativity is based on two postulates formu- 
lated by Albert Einstein: 

1. All laws of nature are the same in all inertia reference frames. 
In other words, we can say that the equations expressing the laws of 
nature are invariant 1 with respect to transformations of coordinates and 
time from one inertial reference frame to another. 

2. Light always propagates in a vacuum at a definite constant speed c 
not depending on the state of motion of the emitting body. 

The first postulate is called Einstein’s principle of relativity, and 
the second is called the principle of constancy of the speed of light. 

Newtonian mechanics proceeds from the assumption that inter- 
actions are transmitted instantaneously from body to body. This, 
particularly, manifests itself in that the interaction of particles is 
described with the aid of the potential energy U (r 1 , r 2 , . . . .), 
which depends only on the coordinates of the particles. It is thus 
assumed that a change in the position of one of the particles affects 
the other particles at the very same instant. 

Actually, as shown by experiments, there are no instantaneous 
interactions in nature. If the position of a particle changes, this 
change begins to tell on another particle interacting with it after 
a finite interval of time elapses, needed for the interaction propagat- 
ing at a finite velocity to cover a path equal to the distance between 
the particles. Consequently, we must acknowledge the existence of 
a maximum velocity of propagation of interactions. Experiments 
show that this velocity equals c— the speed of light in a vacuum. 

It also follows from the second postulate that the velocity of 
propagation of interactions is the same in all inertial reference 
frames, i.e. is a universal constant. 

In accordance with Galileo’s mechanical principle of relativity, 
the laws of mechanics are invariant relative to the Galilean transfor- 
mations: 

x — x ' 1 + v 0 t\ y — y { , z = z\- t = t' (33.1) 


1 The invariance of an equation signifies that the form of the equation 
does not change when the coordinates and time of one reference frame are 
replaced in it with the coordinates and time of another frame. 
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(v 0 is the velocity of the frame K' relative to the frame K). These 
transformations lead to the classical law of velocity summation; 

v = v' + v 0 (33.2) 

The latter equation and, consequently, Eqs. (33.1) from which 
it follows, does not agree with Einstein’s second postulate accord- 
ing to which c = c' for a light signal. 

Consider two inertial reference frames K and K' . We shall select 
the coordinate axes of these frames so that the axes x and x' are 
directed along the velocity v„ of the frame K', and the axes y and z 



are parallel to the axes y’ and z' (Fig. 33.1). We shall begin to 
measure the time in both frames from the instant when the origins 
of the systems coincide. Assume that a light signal propagating 
in all directions was sent at the instant t = t' — 0 from the coincid- 
ing origins of coordinates. By the instant t , the signal in the frame K 
will reach points at the distance l — ct from 0. The coordinates of 
these points satisfy the equation > 

cH 2 - x 2 - y 2 - z 2 = 0 (33.3) 

Similarly, by the instant t', the signal in the frame K' will reach 
points of a sphere of the radius ct'. The coordinates of these points 
satisfy the equation 

cH ' 2 - x' 2 — y n — z' 2 = 0 (33.4) 

Equations (33.3) and (33.4) have the same form, which manifests 
the invariance of the law of propagation of light with respect to 
a transformation of coordinates and time from one frame to another. 
If we introduce into (33.3) the values of the unprimed coordinates 
and time determined by formulas (33.1), we get the relation 

c 2 f ' 2 - x' 2 — y' 2 - z' 2 — 2 v Q x'f - t#' 2 = 0 

not coinciding with (33.4). Hence, we have again arrived at the 
conclusion that the Galilean transformations are not compatible 
with the principle of the constancy of the speed of light. 
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According to Einstein’s principle of relativity, all the laws of 
nature, including the laws of mechanics and electrodynamics, must 
be invariant relative to the same transformations of the coordinates- 
and time performed in passing from one reference frame to another. 
But Newton’s equations and Maxwell’s equations do not meet this- 
requirement. Whereas Newton’s equations are invariant with respect 
to the Galilean transformations, Maxwell’s equations, as can readily 
be seen by direct verification, are not invariant relative to these 
transformations. This circumstance led Einstein to the conclusion 
that Newton’s equations need refinement, as a result of which the 
laws of mechanics and electrodynamics would be invariant with 
respect to the same transformations. The required modification of 
the laws of mechanics was performed by Einstein. The result was 
the appearance of mechanics agreeing with Einstein’s principle of 
relativity, which was given the name of relativistic mechanics. The 
present chapter is devoted to a treatment of the fundamentals of 
this mechanics. 

34. interval 

An event occurring with a particle is characterized by the place 
where it occurred (i.e. a set of values of x, y, z) and the time t 
when it occurred. If we introduce an imaginary four-dimensional 
space along whose axes w r e lay off the space coordinates x , y , z 
and the time t (or a quantity proportional to t), an event will be 
characterized in this space by a point. A point depicting an event 
in a four-dimensional space is called a world point. With time, 
a world point corresponding to a given particle moves in four-di- 
mensional space, describing a line known as a world line. 

Let us consider two events, the first of w'hich consists in the emis- 
sion of a light signal from a point with the coordinates x x , jq, z l 
at the instant t x , and the second in the arrival of this signal at a point 
with the coordinates x 2 , y 2 , z 2 at the instant t 2 . The following 
relation holds between the coordinates and time of these two events: 

C 2 (h — h) 2 — {x 2 — x 2 ) 2 — (y 2 — yj} 2 — (z 2 — z 2 ) 2 = 0 (34.1) 

The quantity 

l lt = (*s ~ *i) 2 + ( y 2 — l/i) 2 + ( Z 2 — z i)* (34.2) 
is the square of the distance (or of the interval) between two points 
in conventional space. We can speak in a similar way about the 
distance (interval) between two points in a four-dimensional space. 
The interval between two events is defined to be the quantity s 12 
whose square is determined by the formula 

= c 2 (t 2 — tj) 2 — ( x 2 — Xj) 2 — (y 2 — yj 2 — (z 2 — Zj) 2 

= C 2 (t2 - tl ) 2 - l\ 2 (34.3) 




128 


MECHANICS 


For two infinitely close events, the square of the interval is 

ds 2 = c 2 dt 2 - dl 2 (34.4) 

For two events consisting in the emission of a light signal at one 
point and its arrival at another point, the interval is zero: 

As 2 = c 2 At 2 - Al 2 = 0 (34.5) 

Isee formula (34.1)]. Owing to the constancy of the speed of light, 
Eq. (34.1) must hold in any inertial reference frame. Consequently, 
if an interval equals zero in the frame K, it will be zero in any other 
frame K'. 

Hence, an interval must become equal to zero in all reference 
frames simultaneously. It thus follows that the interval As between 
■events expressed in the frame K must be related to the interval A s' 
between the same events expressed in the frame K' by the equation 

As = aAs' (34.6) 

But owing to the complete equivalence of the frames K and K' , we 
can write on the same grounds that 

A s' — aAs (34.7) 

where a has the same value as in formula (34.6). 

Multiplying relations (34.6) and (34.7), we find that 

a 2 = 1 

whence a = ± 1. It is natural to assume that the sign of the interval 
in all the reference frames must be the same. Therefore the value of a 
equal to — 1 must be discarded. We thus arrive at the conclusion that 
the interval between two events is an invariant: 

As = A s' (34.8) 

The result we have obtained indicates the expediency of the defini- 
tion of the interval between two points of four-dimensional space 
that We have adopted. The interval determined by formula (34.3) 
is invariant with respect to transformations of the coordinates and 
time from one reference frame to another, i.e. behaves like the 
distance (34.2) between two points in conventional space. 

We must underline the fact that the conclusion on the invariance 
of quantity (34.3) is a logical corollary of Einstein’s postulates. 
Proceeding from the invariance of the interval, we can write that 

As 2 = c 2 At 2 - A l 2 = c 2 At' 2 - A V 2 (34.9) 

Assume that As 2 > 0, i.e. that the interval is real. We can therefore 
find such a reference frame K' in which A l' will equal zero. In this 
frame, events separated by the interval As will occur at one point. 
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The time interval between events in the frame K' is 

Ai' = — (34.10) 

c 

Real intervals are called time-like. 

Now assume that As 2 < 0, i.e. that the interval is imaginary. 
We can therefore find such a reference frame K' in which At' = 0, 
i.e. events occur simultaneously. The distance between the points 
at which the events occurred in the frame K' is 

AV = i As (34.11) 


Imaginary intervals are called space-like. 

Events occurring with the same particle can be separated only by 
a time-like interval. Indeed, since a particle cannot move at a veloc- 
ity greater than c, the distance A l which it travels during the time 
At cannot exceed c At, i.e. Al ^ c At. Hence, As 2 0. 

A space-like interval can separate only events having no causal 
relation. Indeed, if As 2 < 0, then Al >• c At. Consequently, no 
action emerging from one point of space can reach another point 
during the time At and affect an event occurring at this point. 

Consider a particle moving uniformly at the velocity v relative 
to a frame K (a laboratory frame). Assume that two events occur 
with this particle and are separated by a time interval equal to dt 
in the frame K. Let us introduce a frame K' relative to which the 
particle is at rest. In this frame, the time interval between the events 
being considered is 



[see (34.10)1. 

It is a simple matter to see that the time interval dt' is measured 
by a clock moving together with the particle relative to K. The time 
measured by means of a clock moving together with a body is called 
the proper time of the body. Denoting the proper time by the sym- 
bol x, we can write 

dx «=■£■ (34.12) 

Since ds is an invariant, and c is a constant, the proper time dx is 
an invariant. 

Let us find the relation between the proper time dx and the time 
dt measured by means of a clock belonging to the frame K relative 
to which the particle and the (proper) clock associated with it move 
at the velocity v. For this purpose,, we shall introduce into (34.12) 
the expression for ds in terms of thfe coordinates and the time in the 


9—018 
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frame K: 

dx ds — V e2dt *— dl * 
c e 

[see formula (34.4)]. Let us transform the expression obtained as 
follows: 

E 

But dl/dt is the velocity v of the particle. Hence, 

dx = dt^ 1--J (34.13) 

We conclude from (34.13) that the proper time of the particle is 
always less than the relevant time interval in the stationary (labora- 
tory) frame. 

We have obtained formula (34.13) for the uniform motion of a 
particle. It is also valid for non-uniform motion. We can therefore 
write the following equation for finite time intervals: 

At== j y 1-^r* (34.14) 

h 

where v — v (f) is the velocity of the body for which the proper time 
is being calculated. 

35. Lorenfz Transformations 

We established in the preceding section that the interval As be- 
tween two points in four-dimensional space is an invariant, i.e. 
behaves like the magnitude of a vector in Euclidean space. This 
gives us the grounds to consider As as the magnitude (“length”) of 
a four-dimensional vector (a four-vector) conducted from one world 
point to another. 

If we introduce the notation 

x° = ct, x 1 — x, x 2, — y, x 3 = z (35.1) 

the square of the interval becomes 

As 2 = (Ax 0 ) 2 — (Aa: 1 ) 2 - (Ax 2 ) 2 — (Ax 2 ) 2 

The following relation holds for the distance A l between two points 
in Euclidean space: 

A l 2 = | r 2 — r x | 2 = Ax 2 + Ax 2 + Axf 

i.e. A l equals the magnitude of the difference of the points’ position 
vectors. Similarly, the interval As can be represented as the magni- 
tude of the difference of four-position vectors of the relevant world 
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points. Consequently, the coordinates x°, z 1 , z 2 , z 3 are the compo-, 
nents of the four-position vector of a world point. The square of the 
magnitude of this position vector is 

(x°) a — (x 1 ) 2 — (z 2 ) 2 — (z 3 ) 2 


A comparison of the last expression with formula (XII. 5) shows that 
the space in which an event is depicted by a world point with the 
coordinates (35.1) has a pseudo-Euclidean metric determined by the 
tensor (XII. 4). 

Consequently, the square of the four-position vector can be repre- 
sented as 

3 

z°z 0 + z i z l + z 2 z 2 + z 3 z 3 = 2 x>*x u (35.2) 

n=o 

[see formula (XII.31)]. 

The components of a four-position vector are transformed by the 
formula 

3 

z'v- = 2 (35.3) 

v=0 


Let us take two inertial reference frames as the coordinate systems 
K and K' in a pseudo-Euclidean space. We shall direct the axes of 
these frames in accordance with Fig. 33.1. Hence, as is established 
in Appendix XII, the matrix of the transformation coefficients is 
as follows [see (XII. 21)]: 


Kl = 


a 0 a, 0 0 
a, a 0 0 0 
0 0 10 
0 0 0 1 


(35.4) 


while 

a o ~ a i “ 1 (35.5) 

[see formula (XII. 22)]. 

The coefficients a 0 and a r can depend only on the relative velocity 
i> 0 of the frames. To find the form of this relation, let us write formu- 
la (35.3) for z n . With a view to (35.4), we obtain 

z n = a iZ° + aoS 1 

We replace x n with x' , x° with ct, and x 1 with x [see (35.1)]: 

x’ = a x ct a.oX (35.6) 

Let us write the expression, obtained for point O'— the origin of 
coordinates of the frame K' ( see Fig. 33.1). For this point, x* = 0, 


9* 
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and x = v 0 t. Substitution into (35.6) yields 

0 = a tft -f- a 0 v 0 t 

whence 

a, = — a 0 -f 


(37.5) 

Introducing this value of into relation (35.5), we find that 


or 


We obtain from (35.7) that 



(35.8) 
(35 - 9) 

Hence, the matrix (35.4) in the case we are interested in will be 

1 -P 


Kl = 


V l-p» /l-p 2 
-P 1 


0 0 
0 0 


(35.10) 


(35.11) 


/l-p 2 / l-p a 

0 0 10 

0 0 0 1. 

We have introduced the symbol 

r c 

The matrix [c*v] of the reverse transformation differs from (35.10) 
only in that (5 in the numerator is preceded by a plus sign [see the 
matrix (XII. 23)]. 

Substitution of the values of we have found into (35.3) leads 
to formulas for the transformation of the components of a four- 
position vector: 

, n so-fte 1 M— 

— r- » 


r '2 — * 7*2 < 7*^3 — t <3 

y tJU •</ f tA. 


/l — P 2 ’ /l-P 2 

. The formulas for the inverse transformation are 


x°== 


*'•+ p*' 1 


/l-p» * ^ /l-p 2 ’ 

Going over in formulas (35.12) and (35.13) to the conventional 
symbols t, x, y, z, we obtain 


Pi'n + i ' 1 


(35.12) 

x 2 = z' 2 , x* = x' 3 (35.13) 


t — (vo/c 2 )x 
- ’ 

/ l>0* 

y' = y, z' = 2 (35.14) 

/ 1-p* ’ 

. ... t'+(v o/e*) *' 

x'+Vflf' 

y = y', 2-2' (35.15) 

yTl-rp*. 

Vl-P*.’ 
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Formulas (35.14) and (35.15) are known as the Lorentz transforma- 
tions. We invite our reader to convince himself that these transfor- 
mations leave the interval s ia between two events invariant. 

The formulas for the inverse transformation (35.15) differ from those 
for the direct transformation (35.14) only in the sign of v 0 . This 
should be expected with a view to the total equivalence of both 
reference frames, and also to the circumstance that for the given K 
and K' the projections of the velocity of relative motion onto the 
axes x and x' differ in their signs. Indeed, if the velocity of the 
frame K' relative to the frame K is directed to the right (and its 
projection onto the x-axis is positive), the velocity of the frame K 
relative to K' is directed to the left (and its projection onto the 
x'-axis is negative). 

At velocities v 0 so small that the ratio vjc may be disregarded in 
comparison with unity, it is easy to see that the Lorentz transforma- 
tions change into the Galilean ones. 

The Lorentz transformations allow us to obtain formulas for the 
transformation of the lengths and time intervals in passing from 
one inertial reference frame to another (this is done in any general 
course of physics). We shall limit ourselves to recalling the formula 
for the Lorentz contraction of a body’s length (in the direction of 
its motion): 

l = l 0 |/l— J (35.16) 

Here l 0 is the proper length of the body, for instance a rod (i.e. the 
length of the body in the reference frame in which it is at rest), and l 
is the length of the body in the reference frame relative to which 
it is moving at the velocity v. 

Since the lateral, dimensions of a body do not change during its 
motion, the volume of the body contracts in accordance with the 
formula 

V — V 0 j/ 1--J (35.17) 

where V 0 is the proper volume of the body, and V is its volume in 
the frame relative to which it is moving at the velocity v. 

Let us find formulas for the transformation of the components of 
a particle’s velocity. From formulas (35.14), we have 

if- ll ‘~ ( 1 C iy* . fa'- d *Z^ . dy'-dy, iz’-dz 
Consequently, 

, _ dx' _ dx — v t dt dxldt — v 0 _ Vx — v o 

v *~~ dt' ~ dt — (v B lc 2 ) dx — 1 — (^o/c 2 ) (dx/dt) ~~ i — v^Vx/c 1 
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Therefore, 


t>* = - 


Vx — Vt 


< 35 * 18 > 

Similar calculations lead to formulas for transforming the two other 
velocity components: 

1 ’v /i-P 4 ^ 


v„ = 


Jy l — v 0 v x /c* ’ Vz i—v 0 v x /c* 
The formulas for the inverse transformation are 

v' v 


v,= 


Vx-h V» 

1 + Vqv' x /c 2 * 


v u = 


1 + Vov’xlc* 


v’zVi-P 
i + i'ov’x/c 2 


(35.19) 


(35.20) 


Assume that the velocity v of a particle makes the angle 0 with 
the x-axis, and the angle 0' with the x'-axis (the axes x and x', 
by parallel translation, can always be brought into a position in 
Which they will be in the same plane as v). Let us find the relation 
between the angles 0 and 0'. We arrange the axes y and y’ in the 
plane defined by the x-axis and the direction of the vector v. The 
latter will therefore be in the plane xy, and we can write 

1 i>* = i;cos0, :Uj, = i>sin0 


i4 = i/cos0 f , v' v ~v' sin0' 


where v is the magnitude of the velocity in the frame K , and v' is 
its magnitude in the frame K'. 

With the aid of formulas (35.18) and (35.19), we find that t 


tar , 0' = ZL = = vain 9 Y 1 — 

o' x V x — Vo V cos 0 — v 0 


(35.21) 


This formula allows us to find the angle 0' made by the vector v' 
with the x'-axis from the known values of v and 0. Similarly, we 
can find a formula allowing us to determine the angle 0 between 
the vector v and the x-axis when we know v' and 0'. Using formulas 
(35.20), we find that 


v„ v'y Vi -$ 2 _ V' sin 9' yi-P 2 
an v x v x -j- v 0 v'cos8'+v 0 


(35.22) 


36. Four-Dimensional Velocity and Acceleration 

In Appendix XII, we defined a four-vector as a set of the quanti- 
ties a 0 , a 1 , a 2 , a 3 which in passing over from one system of coordi- 
nates to another are transformed according to the same rules as the 
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components of a four-position vector. Consequently, the formulas for 
transforming the quantities av- are similar to formulas (35.12): 


a° — 3a 

7r=F’ 


a' 1 


— PaO-f-a 1 


a' 2 — a 2 , a’ 3 — a 3 


(36.1) 


The formulas for the inverse transformation diSer from (36.1) in 
the sign of P: 


a 0 — 


a'Q + Pa' 1 


a 1 


pa'Q + o ' 1 

Yi=f* ’ 


a 2 — a' 2 , a 3 — a' 3 


(36.2) 


Let us consider the four-vectors of velocity and acceleration. In 
non-relativistic mechanics, both the space intervals dl and the time 
intervals dt are assumed to be invariant. Therefore, the set of quanti- 
ties obtained after dividing the components of the three-dimensional 
vector dr by the invariant dt forms the three-dimensional vector v — 
the velocity vector of a particle. Similarly, the set of quantities 
obtained after dividing the components of the vector dx by the in- 
variant dt is the acceleration vector w. 

We have seen that actually neither dl nor dt is invariant. What 
is invariant is the interval ds related to dl and dt by the expres- 
sion ds 2 = c 2 dt 2 — dl 2 . The invariance of the interval made it 
possible to introduce a four-position vector with the components 
x ° , x 1 , x 2 , x 3 that is an analogue of a three-dimensional position 
vector with the components x t , x 2 , x 3 . Let us attempt to find the 
four-dimensional analogues of the three-vectors v and w. 

It is evident that the set of the four quantities dx^/dt does not 
have the properties of a four-vector because dt is not an invariant 
and 2 (dx^/dt) ( dxjdt ) does not retain its value in Lorentz transfor- 
mations. But we know an invariant that is a “relative” of dt. It is 
the proper time dx — ds/c [see (34.12)1. Since dx is an invariant 
(i.e. a scalar), the quantities 


uv- 


dx P 
dx 



(36.3) 


have the properties of components of a four-vector. It is called the 
four-dimensional velocity (four-velocity) of a particle 1 . 
Similarly, a four-vector with the components 



du* 1 , du* 
dx C ds 


is called the four-dimensional acceleration of a particle. 


(36.4) 


1 Einstein defined the four-velocity as a vector with the components u* 1 = 
= dx^/ds. It is obvious that the velocity defined in this way is a dimensionless 
quantity similar to v/c. 
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Taking into account the values of dx **, and also the circumstance 
that 

dx = dt]/ (36.5) 

[see (34.13)], it is a simple matter to obtain the following values for 
the components of the four-velocity: 

u° = -? =-! u h == ^l\ r (k — i, 2, 3) (36.6) 

Yi-vVc * /l -V*/c* v y ' 

which can be written as 

uP = ( yJ -L= - , - /^-= r ) (36.7) 

\ /l — ya/c 2 Y l—v 2 /c* / v 7 

[see formulas (XII. 34) and (XII. 35)]. Here v is the conventional 
three-dimensional velocity of a particle, and v k are its projections 
onto the axes x, y, z. Of importance for our further discussion is 
the fact that when the spatial part of the four-velocity trans- 

forms into the conventional velocity v. 

We easily find from formulas (36.6) that 

3 

2 uV-Up. — c z (36.8) 

(if we determine in the same way as Einstein did, 2 — !)• 

Differentiating formula (36.8) with respect to t, we obtain 

2 -fr “n + 2 ^ = 2 “’““if + 2 u * w » = 0 

H n n |X 

According to (XII. 33), both sums are equivalent, so that 

2 = 0 (36.9) 

n 

It follows from Eq. (36.9) that the vectors of the four-dimensional 
velocity and acceleration are mutually perpendicular. 

37. Relativistic Dynamics 

Newton’s equations are invariant with respect to the Galilean 
transformations, but are not invariant with respect to the Lorentz 
ones. Consequently, to satisfy Einstein’s principle of relativity, 
Newton’s second law must be replaced with a more general one. 
Having in view that when {5 ->0 (i.e. when v 0 /c ->0), the Lorentz 
transformations convert to the Galilean transformations, the relativ- 
istic-invariant equations of motion at r<c must convert to the 
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Newtonian equations: 

(mv t ) — F t (£-1,2,3) (37.1) 

The following relations are a natural four-dimensional generaliza- 
tion of these equations: 

±(muy) = K» ((i = 0, 1, 2, 3) (37.2) 


where t is the proper time, m is an invariant quantity characterizing 
the inert properties of a particle (the mass of the particle), is 
a component of the particle’s four-velocity, and, finally, is 
a four-vector known as the Minkowski force. The values of must 
be determined so that when K<c, the spatial components of 
Eqs. (37.2) transform into Eqs. (37.1) like the spatial components 
of the four-velocity in this case transform into the conventional 
velocity v. 

Taking into consideration expressions (36.5) and (36.6) for x 
and u»S we shall write Eqs. (37.2) as 


1 d 

y\ — u 2/c 2 dt 


1 d ( 

/r — v 2/c* dt \ 


( 


me 

Y'i—vVc* 


mvi 


-)=K°, 

)=K t (£ = 1,2,3) 


Multiplying these equations by ]/" 1 — • n 2 /c 2 , we get 

&(y&i?)-' PV * =uq * < 37 - 3 > 

(i=li2>3) < 37 - 4 > 


If we determine the spatial components Ki of the Minkowski force 
so that they are related to the components of the conventional three- 
dimensional force F t by the expressions 

F t = K i Vi-v l lc* (i = l, 2, 3) (37.5) 

Eqs. (37.4) become 

£(yT=kr) =F ‘ “ =1 ' 2 ' 3 > < 37 - 6 > 

It can be seen that when Kc, Eq. (37.6), as is required, trans- 
forms into Newton’s equations (37.1). 

To determine the time component K° of the Minkowski force, let 
us multiply Eq. (37.2) by the four-velocity u^. The result is 


= m -j£- = mw^ 
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(we have taken into account that m is an invariant and, consequently, 
it can be put outside the derivative sign). Summation of the obtain- 
ed equations over p yields 

3 3 

2 = m 2 w' l u ll = 0 

n—o n— 0 

[see (36.9)]. Substitution for uv- of the values (36.6) and for K t of 
the values obtained from (37.5) yields 


whence 


3 

j£o £ Vi Fi g 

Y 1 — u 2 /c 2 y 1 — v 2 /c 2 Y 1 — ^ 2 /c 2 


K° = 


c Y l — 




Fv 


i=l 


c / 1 — tf 2 /c 2 


(37.7) 


Now we can write all the components of the Minkowski force. 
With a view to formulas (37.5) and (37.7), we have 


K° 


ll . . vi H 

c Y 1 — i> 2 /c 2 ’ Vi — v*/c* 


(t = l, 2, 3) (37.8) 


Hence, 



Fv 

c Y 1 — f 2 /c 2 ’ 


F 

/ 1— y 2 /c 2 


) 


(37.9) 


The scalar product of the three-vectors F and v gives the work done 
by the force F on a particle in unit time. This work equals the rate 
of change in the particle’s energy, i.e. dEldt. Consequently, expres- 
sion (37.7) for K° can be given the form 


A° = 


dE 


c Y 1 — t> 2 /c ! dt 


(37.10) 


where E is the energy of a particle. 

,We have thus established that the relativistic-invariant 1 equation 
of the dynamics of a particle has the form of (37.2), where u.v is the 
four-velocity with the components (36.6), and K» is the four-force 
(the Minkowski force) with the components (37.8). The spatial com- 
ponents of Eq. (37.2) can be represented in the form of (37.4) or 
(37.6). At the limit when d<«, these equations transform into 
Newton’s equations [see (37.1)]. 


1 Since u* 1 and are four-vectors, the form of Eq. (37.2) in Lorentz trans- 

formations remains unchanged ( m is an invariant by definition). 
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The time component of Eq. (37.2) [see (37.3)1 after substituting 
for K° its value from (37.10) becomes 

We thus conclude that the relativistic expression for the energy of 
a particle is 

E = -y —X r r r r + COnSt (37.12) 

Y i-vVc* v ' 


38. Momentum and Energy of a Particle 

In classical mechanics, the momentum of a particle is defined as 
a three-vector having the components 

pj e,) = mvt (i = 1, 2, 3) (38.1) 

The four-dimensional analogue of this momentum is the four-vector 
having the components 

p» = mui* (p = 0, 1, 2, 3) (38.2) 

where u* 1 are the components of the four-velocity 1 . Introducing the 
values (36.6) for u», we obtain 


p° = 


me 


Y i—V*/C* 


p i = 


mvi 


Y i — f 2 /c 2 


(«==!, 2, 3) (38.3) 


which can be written in the form 


p»=(—^=, . m . \ , ) (38.4) 

v Yi—v 2 !^ Y i — t> 2 /c a ) 

It is easy to see that whenn c, the formula for the spatial com- 
ponents of the relativistic niomentum transforms into the Newtonian 
formula (38.1). This gives us the grounds to adopt the following 
formula as the relativistic expression for the conventional three- 
dimensional momentum: 


my 

P = Y'l-vVc* 


(38.5) 


Let us now turn to the time component of the four-momentum. 
At the end of Sec. 37, we obtained formula (37.12) for the energy E 
of a particle without dealing with the value of the integration con- 
stant. Comparing expression (38.3) for p° with formula (37.12), it is 
a simple matter to see that by assuming the constant to be zero, we 


1 If the four-velocity was defined as uP = dxP/d$ (see the footnote on p. 135), 

the four-momentum is defined as p** = meu 
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can obtain the relation 

p» = 4 (38.6) 

In this case, the expression for the four-momentum becomes 

p “=( 4 - p ) < 38 - 7 > 

where p is the quantity determined by formula (38.5). 

The energy and (conventional) momentum are thus the components 
of a single four-vector— the four-momentum of a particle (this four- 
vector is sometimes called the momentum-energy four-vector). This 
circumstance allows us to use formulas (36.1) to find the rules for 
the transformation of E and p in passing from one inertial reference 
frame to another. Introducing the relevant values of p>* into (36.1), 
we easily obtain 


E 1 -\-Vq p' x 

V ’ 


__ Px-\-ME'lc i ) 

/l— P a ’ 


Py = Pz = p't (38.8) 


The inverse transformations differ in the sign of v 0 . 

Let us find the square of the four-momentum. From Eq. (38.7), 
we get 

2 p^S- p 2 

n=0 


[see formula (XII. 38)]. At the same time 

3 3 3 

S P^Pn — 2 (tf 1 ^) (mu^) = m 2 2 = m 2 c 2 

n=o (1=0 u=*o 

[see (36.8)]. We thus arrive at the relation 


(38.9) 


E * 

c 2 


— p 2 


= m 2 c 2 


(38.10) 


We must note that the square of the four-momentum, like that of 
any four-vector, is an invariant. 

Assuming the constant in formula (37.12) to be zero, we obtain 
the following expression for the energy of a particle: 


E 


me 2 

Y l—v 2 /c 2 


(38.11) 


The quantity E determined by expression (38.11) is called the total 
energy of a particle. It must be borne in mind that E does not in- 
clude the potential energy of the particle in an external force field 1 . 


1 It must be remembered that by (37.7) and (37.10) dEldt — Fv. In Newto- 
nian mechanics, on the other hand, tne work of the resultant of all the forces 
acting on a particle equals the increment of its kinetic energy T, and not of its 
summary energy T + U. 
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For a particle at rest (i.e. when v — 0), expression (38.11) becomes 

E 0 = me 2 (38.12) 

where E 0 stands for the value of E at v = 0. This value is known as 
the rest energy of a particle. We remind our reader that for brevity’s 
sake, we use the term particle to denote a point particle, i.e. a body 
whoso dimensions wo may disregard. Tho rest enorgy of such a body 
consists of the rest energies of the particles which the body consists 
of 1 , of the kinetic energies of these particles, and of the energy of 
their interaction with one another. It thus follows that 

me 2 > 2 m a c 2 

a 

where m is the mass of the body, and m a are the rest masses of the 
particles forming the body. Hence, the mass of a body does not equal 
the sum of the masses of its parts. 

Einstein at one time spent a lot of effort to substantiate the cor- 
rectness of the assumption that the constant in (37.12) equals zero or, 
in other words, to substantiate the statement that the energy me 2 
is stored in the mass m [see (38.12)]. For this purpose, he considered 
several specific phenomena and showed that in each of them the 
change in a body’s energy by A E leads to a change in its mass by 
Am = A Etc 2 . Matters are much simpler at present. To substantiate 
the relation A E — c 2 Am , it is sufficient, for example, to consider 
the process of the transformation of an electron and a positron at 
rest into two gamma-quanta. The corresponding measurements 
show that the total energy of these gamma-quanta exactly equals 
the sum of the rest energies of the electron and the positron. 

The difference between the total energy (38.11) and the rest energy 
(38.12) gives the kinetic energy of a particle 

T = , mc * ■ —me 2 (38.13) 

At small v's, this formula transforms into the Newtonian expression 
for the kinetic energy 

T « me 2 ^ 1 j — mc 2 = -^- mi; 2 (38.14) 

Let us now consider the momentum of a particle. Expression (38.5) 
which we have obtained can be interpreted to signify that the depen- 
dence of the momentum of a particle on the velocity is actually more 
complicated than is assumed in Newtonian mechanics. A different 
interpretation is also possible — we may consider that the relativistic 
momentum of a particle equals, as in Newtonian mechanics, the 
product of the mass and the velocity, but the mass is not constant 

1 We now give a different meaning to the term particle than we did up to now. 
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and depends on the velocity according to the law 


m(v)=-y=ZL= (38.15) 

v ’ V i—vVe* v ' 

Hence, for the relativistic momentum, we can write the expression 

p = m (v) v 

that is similar to the Newtonian expression p = my. 

Assuming in formula (38.15) that v equals zero, we find that 
m (0) = m. Consequently, the invariant quantity m can be consid- 
ered as the value of the mass of a particle at rest. In general courses of 
physics, this quantity is called the rest mass and is usually designat- 
ed by m 0 . The mass m ( v ) determined by formula (38.15), on the 
other hand, is called the relativistic mass and is designated by m. 
It is customary practice in theoretical physics, however, to deal only 
with the invariant mass. There is therefore no need to use the sub- 
script “0” on the symbol m and the word “rest” in the name of the 
mass of a particle. 

The equation of motion (37.6) established in Sec. 37 can be written 
as follows: 


< 38 - 16 > 

It is not difficult to see that 

d / mv \ m dv , mvv dv ,qo a 7 \ 

~dt l y 1 — t? 2 /c 2 ' Y 1 — v 2 /c 2 c 2 (1— vVc 2 ) 312 ' 


Assume that the velocity v and the force F acting on a particle are 
collinear. Now the velocity changes only in magnitude, and 
v ( dv/dt ) — v ( dy/dt ). After making this substitution in the second 
term of formula (38.17), putting dy/dt outside the parentheses, and 
performing simple transformations, we get 


m dv 

(i—v 2 /c 2 ) i/2 dt 


F 


(38.18) 


Now assume that the force F is perpendicular to the velocity v. 
Hence, the velocity changes only in direction, and dv/dt = 0. Conse- 
quently, formula (38.16) becomes 


m dv 
~y r 1 — v 2 /c 2 


= F 


(38.19) 


A comparison of expressions (38.18) and (38.19) shows that the 
coefficient of proportionality between the force and the acceleration 
in these two cases is different (in Newtonian mechanics it has a single 
value equal to the mass of a particle). 
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In concluding, we shall note that the relation 

E 2 = p 2 c 2 + m 2 c 4 (38.20) 

follows from formula (38.10). The energy expressed in terms of the 
momentum is called the Hamiltonian (see Sec. 30.). Consequently, 
the relativistic expression for the Hamiltonian of a particle is 

H = cVp* + mW (38.21) 

if the particle is free, and 

H = c ]/p 2 + i7i 2 c 2 -f U (38.22) 

if the particle is in an external force field [U is the potential energy 
of the particle in this field; see the paragraph following formula 
(38.11)1. 

39. Action for a Relativistic Particle 

Let us find the expression of the action for a free (i.e. not experienc- 
ing the action of any forces) particle. The integral expressing the 
action must be invariant relative to the Lorentz transformations. 
Consequently, it must be taken over a scalar, and the latter must 
have the form of a differential to the first power. The only scalar of 
this kind that can be associated with a free particle is a quantity 
proportional to the interval ds. Denoting the coefficient of propor- 
tionality by a, we get the following expression for the action: 
i 2 t, _ 

S — j a ds = j ac ]/ 1 — — dt (39.1) 

i u 

[we have used formula (34.4) for ds 2 and taken into account that 
dl/dt equals the velocity v of the particle]. 

Comparing (39.1) with expression (7.1), we arrive at the conclu- 
sion that the Lagrangian for a free relativistic particle must be 

L = ac )/ (39.2) 

At the limit when v <C c, this function must transform into the 
Newtonian expression 

L-~-mv 2 (39.3) 

Let us expand function (39.2) in powers of vie. Ignoring the 
terms of the higher orders, we obtain 

L—acy l__r«ac — w 
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We may discard the constant term ac (see Sec. 7). Consequently, 
in the Newtonian approximation, L — —av 2 /2 c. A comparison with 
(39.3) shows that a must be assumed equal to — me. We have thus 
established the form of the Lagrangian for a free particle 1 : 

L= -me 2 1— jjj- (39.4) 


Knowing the form of the Lagrangian, we can easily find the mo- 
mentum and energy of a particle. Using formulas (9.5) and (5.1), 
we get 


dL rov 

P ~ dy ~ y i_i,2/ c 2 


(39.5) 


2? = pv — L = / mv -f- me 2 V 1 — v 2 /c 2 

r Y i— vVc* 


me 2 

Y i—vVc* 


(39.6) 


We have thus arrived at the same formulas for the momentum 
and energy of a particle that were obtained in Sec. 38. It must be 
borne in mind, however, that we obtained formulas (39.5) and (39.6) 
for a free particle, whereas in Sec. 38 similar formulas were obtained 
with the assumption that a particle experiences the action of forces. 

Let us again revert to expression (39.1). With a view to the value 
of a we have found, we can write the action as 


S— — me f ds (39.7) 

I 

The true trajectory of a particle is determined by the condition 

8S — 0 (39.8) 

For the variation of the action, we have the expression 

2 2 

8S — — mc8 j ds = — me J 6 (ds) 
i 

The interval is 


i 


ds — 



dx* dx^ 


According to (XII. 40), the variation of the radicand can be written as 


6 2 dxV- dx^ = 22 dx^b dxv- 


* For a particle in an external potential force field, we have 
L= — me 2 Y 1 — — U 

where U is the potential energy of the particle. We must note that in relativistic 
mechanics, L does not equal T — U. 
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2 ^ dxnb dx^ 

f 

1 1 /" 2 dx ^ rfx M 

' H 

2 2 dx^b dx n 

= me J — — ^ (39.9) 

1 

The derivative dxjds is ujc [see (36.3)]. In addition, 6 (dxR) — 
= d (bx^) [see (III. 4)]. We therefore arrive at the expression 

2 

65 — — m j 2 u n d (dx* 1 ) 
i n 

Let us integrate this expression by parts: 

bS — —m 2 u^bx* j + m [ 2 ~d^~ (39.10) 

n ‘in 

[we have represented du^ in the form ( dujdx ) dx, where x is the 
proper time of a particle]. At the ends of the trajectory, 62 A — 0. 
Therefore, condition (39.8) becomes 

2 

65 = m j ^bx^-^-dx — O (39.11) 

1 n 

For this condition to be observed with arbitrary values of bx^, 
it is essential that the quantities dujdx vanish, i.e. the four-velocity 
of a particle be constant, which obviously holds for a free particle. 

Let us find the action as a function of the coordinates of a particle 
(i.e. as a function of the upper limit of integration; see Sec. 32). 
For real motion, dujdx — 0, hence the second term in (39.10) 
vanishes. We consider that the lower limit of integration is fixed, 
therefore (fix**), = 0. Consequently, the action as a function of the 
coordinates of a particle satisfies the relation 

3 

bS — — m 2 ujx* 

n=o 

(we have omitted the subscript “2” on 6 xi*). The quantities mu^ 
give covariant components of the four-momentum of a particle 
[see (38.2)] so that the increment of the action can be written as 

3 

65 = — 2 

n=o 


Consequently, 


2 

bS = — me ^ 6 1 / 2 dx„ = — 
1 ' 4 


(39.12) 
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In Sec. 32, we obtained the following expression for the increment 
of the action due to a change in the finite position of a particle in 
conventional (three-dimensional) space: 

3 

6 S=y, pfix t (39.13) 


Isee formula (32.5)]. It is a simple matter to note that the terms of 
formula (39.12) corresponding to the spatial coordinates yield the 
sum (39.13) after the index on Sxm- has been lowered. 

Examination of (39.12) shows that the covariant components of 
the four-momentum can be determined as follows: 

<39 - 14) 

[compare with (32.6)]. 

It was established in Sec. 38 that = m z c 2 . Lowering the 

index of the first multiplier, i.e. replacing the contra variant mul- 
tipliers with the corresponding covariant ones, we find that 

Pi — Pi - P\~ P» = 

Substituting for in this equation their values from (39.14), we 
arrive at the Hamilton-Jacobi relativistic equation 


( ds \ 2 



( dS V 

1 dt I 

l dx ) 

i dy 1 

[ dz ) 


= m z c 2 


(39.15) 


(we have introduced ct instead of x°, x instead of x 1 , etc.). 

The action S in (39.15) differs from the non-relativistic action S'. 
This can readily be understood if we have in mind that the action is 
related to the energy by the expression E = — ( dS/dt ) [see formu- 
la (32.10)]. The non-relativistic energy E', on the other hand, differs 
from the relativistic energy E in the term me 8 (i.e. E — E’ + me"). 
Hence 



dS’ , 


2 


or 


S — S' — mc 2 t 


Using this relation in (39.15), we get an equation for S': 


i ( dS’ \ 2 dS’ 1 re dS ’ ) 2 ( 8S ’ \ 2 i / dS ' I 2- ! _ n 

2 me* \ dt I ■ dt 2m \_\ dx ) \ dy I V dz ) J 


which at the limit when c -*■ oo transforms into the classical Hamil- 
ton-Jacobi equation for a free particle [see formula (32.15) in which 
we must assume that U — 01. 




THE SPECIAL THEORY OF RELATIVITY 


14 ? 


40. Energy-Momentum Tensor 

In this section, wo shall carry out a very important generalization 
of the expression for the action. In such a generalized form, the 
expression for the action can be applied not only to purely mechani- 
cal systems, but also to an electromagnetic field and other physical 
systems. 

Up to now, we have written the expression for the action 

2 

5= J L(q k , g h , t)dt (40.1) 

i 

where L is the Lagrangian, q k = q h ( t ) are the generalized coordinates 

• 

determining the position of the particles of the system, and g* 
are the generalized velocities equal to dqjdt. The quantities q h 

and < 7 a are assumed to depend only on the time. 

When we write equations in the four-dimensional form, we have 
to do with the four formally equivalent variables x°, x 1 , x®, X s 
that must be present in equations in a similar way. To reflect this 
circumstance, we shall write the expression for the action as 

S = j L* (q a , q av , x°, x l , x 2 , x 3 ) dx° dx 1 dx 2 dx 3 (40.2) 

where by q a we understand the set of the quantities g,, q 2 , . . •• 
determining the state of a system (the parameters of the system). 
There may be any number of these parameters, particularly, an 

infinitely large one. By q av is understood the set of the partial deriv- 
atives of the parameters q a with respect to the coordinates x v : 

g a v= 7 % (a — 1 , 2, . . . ; v = 0, 1 , 2, 3) (40.3) 

ox 

The factor 1/c has been introduced' for convenience. 

The quantities q a and q av are considered as functions of the coordi- 
nates x°, x l , x 2 , x 3 . Particularly, the parameters q n may be found 
to depend only on x°. Here, we arrive at the case we already know 
when q a = q a (t). 

We must note that since d 2 qjdx v dx ^ = d 2 qjdx» 5x v , the follow- 
ing relation holds: 

d?qv d9qn 

dx* ~ dx v 

To establish the conformity between expressions (40.2) and (40.1), 
let us take into account that an element of the four-volume dV* 
is related to an element of the volume dV in conventional space and 


10* 
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the time interval dt by the following expression: 

dV* — dx° dx 1 dx 2 dx 3 = c dt dV (40.5) 

Introducing this value of dV* into (40.2), we get the expression 


2 

5= j L* dt dV = j dt j L* dV 


(40.6) 


Integration with respect to dt is performed within the given interval 
of time, with respect to dV — over the entire three-dimensional vol- 
ume. 

A comparison of expressions (40.1) and (40.6) shows that 


L* dV 


(40.7) 


Thus, the function L* is the “density” of the Lagrangian of the sys- 
tem being considered. 

For a closed mechanical system, the Lagrangian does not depend 
explicitly on t (see Sec. 8). Similarly, the absence of an explicit 
dependence of L* on the coordinates x°, x l , x 2 , x 3 should be a math- 
ematical expression of the fact that the system is closed. Hence, 
for a closed system, the action has the form 


s = ~ f L* (q a , ?„v) dV* 


(40.8) 


Let us find the equations of motion for a closed system. For this 
purpose, we shall calculate the variation of the action (40.8) and 
equate it to zero. 

The variation of expression (40.8) is 

a a, v 

By an„alogy with the relation 6 y' = (6 y)' , we have 
After performing this replacement, we obtain 

6^-4- j < 40 - 9 > 

a a, v 

According to the rules for the differentiation of a product, we have 

3 / dL* s \ v dL* d R t 'V s d dL* 

2 -IJT ( -r- 6 <7a ) = 2j -t— — v $q a +' Z % Tv — 

«. a V d * ^ a v dx a v dx dq ay 
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The first sum on the right-hand side is identical to the second term 
of the integrand in formula (40.9). We can therefore write 



a 



-E 8 ?af v 4^1 dV* ( 40 . 10 ) 

o.v 9X 

The second of the sums 



a, v 



is a four-divergence of a vector whose v-th component is 



Therefore, using the four-dimensional analogue of the Ostrogradsky- 
Gauss theorem [see (XII. 72]), the second of the three terms in formu- 
la (40.10) can be replaced with an integral over the closed hypersur- 
face confining the four-volume over which integration is being per- 
formed in (40.10): 



(40.11) 


On the boundary of the four-volume being considered, however, 
the variations 8q a = 0. (Similarly in mechanics, the variations 5q t 
at boundary points are zero.) Consequently, integral (40.11) vanishes, 
so that only the first and third sums remain in formula (40.10). 
Let us combine them, factoring out the common factor 6 q a : 



a 


dx v dq _ J 


Owing to the arbitrary nature of the variations 6 q a , the expres- 
sion we have obtained can equal zero only if all the expressions in 
brackets vanish. We thus arrive at the following equations of mo- 
tion: 


dL* 

dq a 


2 


d dL* 


dx v 


d <ln. 


(a = 1, 2, ...) 


These equations are a generalization of Lagrange’s equations 

dL d dL 


( 40 . 12 ) 
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[see formula (7.3)]. The right-hand side of (40.12) is the sum of four 
derivatives because the role played by the single variable t is now 
played by four variables x y . It is not difficult to see that if q a 's depend 
only on x°, i.e. on t, Eq. (40.12) transforms into Eq. (40.13). 

Let us multiply Eq. (40.12) by q a{l and summate over a: 


2 


dl* 

d<la 


Qav- 2 7oH 


a, v 


d 

dx v 


dL* 

$9av 


The right-hand side can be transformed by the formula 


dv d ( uv ) da 

dx v dx v dx v 


The result is 


2 


9L * 
dq a 




dl* dq a 


a, v 


^?av 


> ^9 a 


dx'- 


(40.14) 


Let us perform the substitutions q ail = dqjdx* in the sum on the 

left-hand side and dq^ldx' 1 = dq av /dx'‘- [see (40.4)] in the second sum 
on the right-hand side, and also group the terms of expression (40.14) 
in a different way. The result is the relation 


S ox, 

dq , 


dL* dq a 


dL* dq a 


dx» 


a, v ^9av 


dx' 1 


2 


dx v 


r dL * \ 

( 7an-r— ) 
' dq a v 


The left-hand side of the expression obtained is dL*/dx **. Conse- 
quently, we have arrived at the formula 


V d / ’ dL* \ 
’ — 2j a v ( 7ou . ) 

dx K dq av > 


dL* 

dx» ^ dx v 
a, v 

The left-hand side of this formula can be written as 1 

d 

— > r>.. = - 

dx' 1 


dx y 


V 


which yields the relation 

The latter can be transformed as follows: 

(40.15) 

v L a Kav j 

Equation (40.15) is a collection of four equations corresponding 
to different values of the index p (p — 0, 1, 2, 3). The expressions 


1 It must be remembered that d<pldz v is a covariant component of a four-veotor. 
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in brackets have the properties of mixed components of a four-tensor 
of rank two. Denoting this tensor by the symbol T ll v , we obtain: 

(40.16) 

a 

Using this symbol, we can write Eq. (40.15) in the form 

2-^ = ° (. = 0, 4, 2, 3) (40.17) 

V 

We remind our reader that we have obtained Eq. (40.15) by equating 
the variation of the action (40.8) to zero. 

The tensor satisfying Eq. (40.17) is determined non-uniquely. 
Any tensor of the form 

77 = ?/ + 2 -0* (40.18) 

P 

where is a tensor antisymmetrical with respect to the in- 
dices v and p (Qlf — — ()£ v ) also satisfies Eq. (40.17). Indeed, 
owing to the antisymmetry of the tensor Q^i we have 

3% p _ 

dx v dx p dx p dx v 

and, consequently, 

v' a y, < p v Q 

^ dx v ^ dx p ^ dxW 

v p V, p 

The following condition will therefore follow from (40.17): 

2^ = 0 (40.19, 

V 

By properly choosing the tensor Q^ p , we can always make the 
tensor (40.18) be symmetric. We shall assume in the following that 
this condition is observed and 

2"nv = yvn (40.20) 

We shall note that since Q™ = 0, 

rp H rp V- 

1 P- — 1 M- 

i.e., 2% is determined by formula (40.16). 


(40.21) 
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•; The transition of the index p upward in (40.19) will either leave 
all the Tj/’s unchanged (if p = 0), or will reverse the sign of all the 
Tj/’s (if (1 = 1,2, 3). Consequently, it follows from (40.19) that 
dT»» 

= ° (1^ = 0, 1,2,3) (40.22) 

OX 

V 

The tensor T^ v is known as the energy- momentum tensor of a 
system. The grounds for this name will come to light below. 

It is shown in Appendix XII that when the tensor Tw satisfies 
the condition (40.22) and all the Tw's vanish at infinity, the vector 
having the components 

p» = a T^dU (40.23) 

V 

[see formula (XII. 86)], remains constant in time (is conserved). In 
Eq. (40.23), a is an arbitrary constant, and df v is a component of 
the four-vector of an element of the hypersurface. The integral 
in Eq. (40.23) is taken over an arbitrary hypersurface including 
the entire three-dimensional space. If we take the hyperplane x° = 
— const as the hypersurface over which integration is being per- 
formed, all the dfv s except df 0 = dV vanish, and expression (40.23) 
will be simplified as follows: 

p» = a j df 0 = a J T»° dV (40.24) 

We know that for a closed system the total (i.e. taken over the 
entire volume) energy and the total (i.e. taken over the entire vol- 
ume) momentum are conserved. The energy and the momentum are 
the components of the four-momentum. Consequently, the four- 
momentum of a closed system must also be conserved. This gives us 
the grounds to identify the vector determined by formulas (40.23) 
and (40.24) with the four-momentum of a system. The constant a 
must be chosen so that formulas (40.23) and (40.24) agree with the 
previous definition of the four-momentum [see (38.7)], according 
to which, for example, p° = Etc. Assuming in formula (40.24) that 
p. = 0, we get 

p° = a j = 4 (40.25) 

Using formulas (40.21) and (40.16), we can write 

T»o = 7# » = f # » = £ - KL* = 2 ?« ~ L * 

a dQaO a 

• • 

where q a = dqjdt [we have taken into consideration that q a0 = 

= dqjdx 0 — dqjd { ct ) = (1/c) {dqjdt) — {Me) gj. 
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The integral j L* dV gives the Lagrangian L. We therefore arrive 
at the relation 

( T'*>dV='2q a -°±--L 

J a a 1a 

According to formula (5.1), the expression on the right-hand side 
determines the energy E of a system. Hence 

j T 00 dV = E (40.26) 

and T°° is the energy density w: 

T 00 = w (40.27) 

Substitution of the value (40.26) into formula (40.25) leads us to 
the conclusion that a — 1/c. Consequently, 


(40.28) 

V 


if integration is performed over an arbitrary hypersurface including 
all the three-dimensional space, and 

p» = ~ j df 0 = i- j 7^° dV (40.29) 

if integration is performed over the hyperplane x° — const. 

To reveal the meaning of the components 2" 0v , let us write expres- 
sion (40.22) for (.i = 0: 


3 3 

V, «J7 0v <97°° , V! <97°* n 

g v ~ g x o -r 2j dx* U 

v=0 ft— 1 


or 

3 

i ar°° _ y dT ° k 

c dt dx h 


Multiplication of the relation obtained by c and integration over 
a certain volume V yield 


— w5 r " ,n '~S 2 c S- iV < 40 ' 30 > 

V V ft=l 

The integral on the left-hand side equals the energy confined in thu 
volume V. The integrand on the right-hand side is the conventional 
(three-dimensional) divergence of a certain vector S having the com- 
ponents 

S x = cT 01 , S v = cT 03 , S z = cT 03 


(40.31> 
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Formula (40.30) can thus be written as 


dE 

dt 


= j VS dV 


Applying the Ostrogradsky-Gauss theorem to the right-hand side, 
we obtain 

1 

where the integral is taken over the closed surface confining the 
volume V. 

The decrement of the energy in the volume V per unit time must 
■equal the energy flux through the surface. Consequently, S is the 
vector of the energy flux density. By (40.31), 

T 0l ~-j-S x , T 02 = -^S y , T°* = ±-S z (40.32) 

Owing to symmetry of the tensor T>* v , the relation T h0 — T oh 
is observed, so that 

T'o=±S XJ T ™ = ±Sy, T* 0 = ±S Z (40.33) 

Hence, the components T oh and T h0 to within the factor 1/c equal 
the relevant components of the vector of the energy flux density. 
The spatial components of the vector (40.28) are 

p h = L j T ho dV 

We thus conclude that the vector g with the components T h0 /c 
determines the density of the momentum of a system: 

g x ~± Tl °, gy = -~T 2 \ g z = l.T* 0 (40.34) 

A comparison of (40.34) and (40.33) shows that 

Sk^~rS k (40.35) 

■or in the vector form 

g = J- = ^- s (40.36) 


We have thus arrived at the conclusion that there is a relation 
letween the energy flux and the momentum— the density of the 
momentum equals the density of the energy flux divided by c 2 . 
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To establish the meaning of the components T ih , let us write 
relation (40.22) for p, = i: 


3 

dT iv dT‘° 

g x V ~ 9X0 


3 


fE 


dTlk 

dx* 


0 


Hence, having in view that x° = ct and T i0 = cg h we find that 

3 


Hi _ -v\ 
dt ~ Zi 

i 


dPk 

dx k 


Let us integrate the expression obtained over a certain volume V. 
Taking into account that j g dV =p (here p is the momentum of 
the part of the system confined in the volume V), we obtain 


3 



V ft=l 


dP* 

dx^ 


dV 


We transform the right-hand side according to the Ostrogradsky- 
Gauss theorem: 

—§tPt = j E (40.37) 

f fc=i / 


where o f is a vector with the components 

oik = T ih 


(40.38) 


The left-hand side of formula (40.37) contains the rate of diminish- 
ing of the i-th component of the momentum confined in the vol- 
ume V. Consequently, the right-hand side has the meaning of the flux 
of the component p t through the surface / confining V, and o ; is 
the density of the flux of p t . The quantity (40.38) is the Ar-th compo- 
nent of the flux density of the component p ; . Hence, the three-dimen- 
sional tensor (40.38) determines the density of the momentum flux. 
We must note that the flux density of a scalar quantity (for instance, 
the energy) is a vector; the flux density of a vector quantity (for 
instance, the momentum) is a tensor. 

The momentum carried through unit area in unit time equals the 
force acting on this area, i.e. the stress at the location of the area. 
This is why the tensor Oj ft is called the stress tensor. 

We have thus established the meaning of all the components of 
the tensor T» v . Combining (40.27), (40.32), (40.33), and (40.38), 
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we obtain 


(T^) 


w 

SJc 

Sy/C 

SJc 

SJc 

®xx 

®xy 

o xz 

Sy/C 

°yx 

a vv 


SJc 

O zX 

°zil 

°zz 


( 40 . 39 ) 


The components of the tensor T^ v characterize the density of the 
energy and the momentum, and also the densities of the fluxes of 
these quantities, which explains the name of the energy- 

momentum tensor. 
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ELECTRODYNAMICS 


Chapter VIII 
ELECTROSTATICS 


41. Electrostatic Field in a Yacuum 

In general, an electric and a magnetic field are closely associated 
with each other, forming a single electromagnetic field. Stationary 
(i.e. not varying with time) electric and magnetic fields, however, 
may be treated separately. We shall begin with considering an elec- 
trostatic field in a vacuum. 

The basic (force) characteristic of an electric field is the field 
strength E related to the force acting on a point charge e at a given 
point in the field by the expression 

F = eE (41.1) 

This relation may be considered as the definition of the quantity E. 

An electrostatic field is a potential one. This signifies that the 
work done by the forces of this field on a charge over any closed 
path is zero: 

|)FdI=e^Edl=0 (41.2) 

A glance at formula (41.2) shows that the circulation of the electro- 
static field strength vector over any contour (closed path) T is zero. 
Using Stokes’s theorem [see (XI. 23)1, we can write 

Edl= j [vE]df = 0 (41.3) 

r / 

where / is an arbitrary surface confined by the contour T, and dt 
is the vector of an elementary area taken on this surface. 

Condition (41.3) must be satisfied for any arbitrarily chosen sur- 
face/. This is possible only if the integrand function at each point is 
zero. We thus arrive at the conclusion that the curl of an electro- 
static field strength vector equals zero at every point of the field: 

[VE] = 0 (41.4) 
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The equality to zero of the strength curl is a feature of an electro- 
static field expressing its potential nature. 

It is known from vector analysis that the curl of the gradient of 
a scalar function always equals zero [see (XI. 43)]. Therefore, the 
strength of an electrostatic field can be represented as the gradient of 
a scalar function cp: 

E— — V<p (41.5) 


(the minus sign does not change matters, it has been taken from 
physical considerations). The function tp is known as the potential 

of an electrostatic field. It is obvious that cp 
has been determined to within an arbitrary 
constant addend. The potential can there- 
fore be measured from any point of a field 
for which its value is taken equal to zero. 
In electrodynamics, the potential is usually 
assumed to be zero at infinity. 

The potential of the field of a point charge 
e equals, as is known, 



r'a) 


cp (r) = ■ 


(41.6) 


where r is a position vector drawn from 
the point where e is to the point for which cp 
is being determined. 

Taking the gradient of expression (41.6) and reversing its sign, 
we find the field strength of a point charge: 


E= — V 




, (41-7) 


where e r is the unit vector of the position vector r [see (XI. 51)]. 

Assume that a field is set up by a system of point charges c a placed 
at points determined by the position vectors Tq (Fig. 41.1). Hence, 
according to the superposition principle, the potential of the field 
set up by this system at the point determined by the position vec- 
tor r is 



(41.8) 

a 

and the field strength is 

(41.9) 


If the charge setting up a field is distributed in space with the 
density p = p (r'), the potential and the strength of the field can be 




ELECTROSTATICS 


159 


calculated by formulas similar to (41.8) and (41.9): 


q> (r) = f 

V 

E ( r ) — j 


p(T')dV 

|r-r'| 

p(r')(r-r')dV' 
I r — r' | 3 


(41.10) 

(41.11) 


where dV' — dx' dy' dz' is a volume element at the point r' (here 
x ' , y' , z' are components of the varying vector r'). 

The transition from the system of point charges e a to a charge 
distributed in space with the density p (r) is accomplished with the 
aid of the Dirac delta function (see Appendix XIII). This function 
allows us to represent the system of point charges e a at points with 
the position vectors r' a by means of the charge density: 


p(r) = S«.S(r-0 (41.12) 

a 


It is obvious that by introducing the function (41.12) into expres- 
sions (41.10) and (41.11) and performing integration, we shall arrive 
at formulas (41.8) and (41.9). 


42. Poisson's Equation 

The general course of physics acquaints us with Gauss’s theorem, 
which for a field in a vacuum can be worded as follows: the flux of the 
vector E through a closed surface is proportional to the algebraic sum 
of the charges confined within the surface. The proportionality con- 
stant depends on the choice of the system of units. In the Gaussian 
system usually employed in theoretical physics, it is 4n. Hence, 

§E n df^4n2* (42.1) 

1 

If the distribution of charges inside the surface / is characterized 
with the aid of the charge density p — p (r), Gauss’s theorem can be 
written in the form 

E n df = An j p dV (42.2) 

t v 

where V is the volume confined by the surface /. 

Applying the Ostrogradsky-Gauss theorem [see (XI. 13)] to the 
left-hand side of formula (42.2), we arrive at the relation 

VE dV — 4n j p dV 
v v 
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The latter expression must be observed for any arbitrarily selected 
volume V. This is possible only if at every point of the field 

VE = 4np (42.3) 

Expressing the field strength in formula (42.3) through the potential 
according to (41.5), we obtain 

V (Vtp) = — 4rcp 

It is shown in Appendix XI [see (XI. 38)] that V (Vcp) = A<p, where A 
is the Laplacian (operator). We thus arrive at the relation 

Arp = — 4.np (42.4) 

known as Poisson’s equation. In the expanded form, this equation is 
as follows: 


3 2< p | 5 2 cp , d 2 <p 
dx 2 ' dy 2 ' di 2 


— 4np 


(42.5) 


Poisson’s equation allows us to find the potential at every point 
of a field according to a given distribution of the charges in space 
(according to the given function p). Knowing <p, we can determine E 
by formula (41.5). 

For points of a field at which p = 0, Eq. (42.4) is 

Aq> = 0 (42.6) 


Relation (42.6) is known as Laplace’s equation. 

In accordance with formula (41.10), the solution of Eq. (42.4) is 



p (r') dV' 
|r— r'| 


(42.7) 


where integration covers the entire region in which the charges set- 
ting up the field are distributed. This statement can be proved strict- 
ly mathematically by applying the Laplacian to the integral (42.7). 

We must note that expression (42.7) also satisfies Poisson’s equation 
if an arbitrary constant (A const = 0) is added to it. Consequently, 
generally speaking, for the solution of Poisson’s equation to be fully 
unique, the boundary conditions must be set. Solution (42.7) (with 
const = 0) is obtained when the potential at infinity is assumed to 
be zero. 

The solution of Poisson’s equation can be shown to be the only 
one with given boundary conditions. This proof, however, is beyond 
the scope of the present text. 

We shall show that the function <p (r) = 1/r satisfies Laplace’s 
equation [Eq. (42.6)] at all points except r = 0. By (XI. 51), 

1 1 r p 

we have V — = — — — = - — g. Applying the operator V once 
more to this expression, we obtain 



■v(^)=-.4 r (V.)-rv(-i.) 
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The divergence of r is 3 [see (XI. 49)]. Consequently 



Q.E.D. 

We can find the values of A (1/r) for all points including r = 0. 
To do this, we shall take advantage of the fact that for a point charge 
e at the origin of coordinates p = (r), and the potential cp = 

= elr. Introducing these values into Eq. (42.4) and cancelling e, 
we arrive at the following mathematical relation: 

A~~ — 4rc6 (r) (42.8) 


43. Expansion of a Field in Multipoles 

Let us consider a system of charges in a restricted volume and 
investigate the field set up by this system at distances that are great 
in comparison with the system’s dimensions. We shall place the 
origin of coordinates inside the volume occupied by the system. 
This gives the following expression for the potential: 

•pM-2-irfkr (43 - 1) 

a 

where according to our assumption r>r a [see (41.8); we have omit- 
ted the prime on r a ]. 

We expand expression (43.1) into a power series in r Jr. For this 
purpose, we express cp (r) in terms of the components of the vectors r 
and r n : 

- - (43.2) 

x ai ) 1 

t 

(the subscript i takes on the values 1 . 2, 3, and the subscript a, 
the values 1, 2, . . ., n, where n is the number of charges in the 
system). Considering the quantities (— x ai ) as small increments of 
the coordinates x t , we can write the function (43.2) as 


<p (xi, ar 2 , **)= 2 


/ 


<p(*‘’ x *’ = 2 y%r + 2 S (y§^~) { ~ Xah) 


+tS 2 


e 2 


2 /J dx h dx m 

a h,m 


/ 

V 


( ah ) ( -^am) “)“•• • 


= 90 + <Pl + 92 + •• • (43.3) 

[each addend of the sum (43.2) is expanded into a series]. 


11-018 




162 


ELECTRODYNAMICS 


Expression (43.3) is known as the expansion of the potential in 
multipoles. The first term of the expansion 


2 e a 
To = -7~ 


(43.4) 


(we have substituted r for ]/" 2 has the form of the potential of 

a point charge. The total charge 2 e a is a zero-order multipole (it is 
also called a monopole). When this multipole is non-zero, the term 
<p 0 makes the main contribution to potential (43.2). 

To establish the form of a first-order multipole, let us transform 
the second term of the expansion (43.3) as follows: 

*- 2 2-4r (-?-)<-*- J~ 2 -&(t)2<*. 

aft ft a 

The sum 2 e a x ak is the projection onto the Ar-th coordinate axis 

a 

of the vector 

P=2v« (43.5) 

a 

This vector is the dipole moment of the system of charges 1 . And it is 
a first-order multipole. 

The expression 



gives the k- th component of the gradient of 1/r. Hence, 

*’-2( v tL 1 a— p-vf <«- 7 > 

ft * 

The formula for <p t can be obtained directly in the vector form, 
taking advantage of the circumstance that by (XI. 5) to within 
first-order terms, we have 

/ (r -f fir) = / (r) -f V/ (r) fir 

1 If a charge is distributed over the volume of a system having the density 
p, the dipole moment is determined by the integral 

p = j p (r) r dV (43.6) 

It must be remembered that for a system whose total charge 2e a equals zero, 
the dipole moment does not depend on the choice of the origin of coordinates. 
Indeed, transferring the origin of coordinates to a point for which r = b, we 
get the values r' a = r a — b for the position vectors from the new origin of coor- 
dinates. The dipole moment in the new system is p' = 2*a r a = 2 e a r a “ 
— 2 f ab = P — b2 e a- Since 2 e a — °> we obtain p' = p. 
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Let us apply this formula to each addend in (43.1), considering — r a 
as Sr. The result is 

(43.8) 

a a 

The first term coincides with (43.4). and the second with (43.7). 

We must note that V (1/r) is proportional to I V' 2 . Consequently, 
the addends of the second sum in (43.8) are quantities of the order 
of rjr relative to the addends of the first sum. 

+« a -« 


a 


Fig. 43.1. 


r- 


~7 


4 

e a 


After calculating V (1/r) and introducing it into (43.7), wejarrive 
at an expression for the potential of the field of a dipole: 

„,= -pvi-=-p (-J r L)=i (43.9) 

Now let us find the field strength of a dipole: 

E= -Vcp,= -V(-Jr)= -(pr)V-^r-4rV(pr) 

Isee (XI. 25)]. Using formulas (XI. 51) and (XI. 37), we obtain 

E = [Vr]] + (r, [Vp]] + (rV)’p + (pV) r} 


The curl of r equals zero [see (XT. 50)], [Vpl ■■-= 0 because p does 
not depend on r, and (rV) p = 0 for the same reason, lienee, of the 
four terms in the braces, only the last one is non-zero, and by 1X1.34) 
it equals p. Therefore, 


3 (pr) jr p _ 3e r (pe r ) — p 

r* r r 3 r 3 


(43.10) 


where e r is the unit vector of the position vector r. 

We must note that the field of a dipole has axial symmetry relative 
to the direction of p. 

Not only the total charge 2 e a> but also the dipole moment p = 
= 2 e a T a may equal zero. This occurs, for example, for the system 
of charges depicted in Fig. 43.1 and known as a quadrupole. Here, 
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the field is determined by the next term of the expansion of the func- 
tion (43.1), quadratic relative to the quantities r Jr. 

Let us write cp 2 as follows [see (43.3)]: 

T 2 2 dx k dx m (~r~ ) -XahXam 

a h, m 

= Y 2 dx k dx m , (t ) { 2 e a* ah* am) (43.11) 

k. m a 


The quantity in braces is the (k, m)-th component of a symmetrical 
tensor of rank two (see Appendix X). This tensor could be adopted 
as the corresponding multipole. But as we shall show below, of the 
nine components of this tensor not six are independent (as in a sym- 
metrical tensor), but only five. To underline this, the tensor char- 
acterizing the property of the system that determines (p 2 is written 
in a different form. We saw in Sec. 42 that the function 1/r satisfies 
Laplace’s equation, i.e. 



— A 

dx\ T 


0 


(according to our assumption r» r a , so that r = 0 is not considered). 
It can easily be seen that this formula can be written as 


c d 2 1 

2j km dxhdx m T 

h. m 


Multiplying the expression we have obtained by e a r 2 J& and then 
summating over a, we obtain 


6 2 e “ r “ 2 ® hm 

a k, m 


a 2 

dxfrdx m 



which can be written as follows: 


92 

2 At dx k dx m 

h, 771 


7* { 2 4 " = 0 

a 


Subtracting this expression from (43.11). we give the formula 
for <p 2 the form 

C f ) 2 = T 2 dx k dx m ~ { 2 e a (x a h x am (43.12) 

k, m a 

The set of quantities 

Qkm ~ 2 e a (3x ak x am — r“6 ftm ) (43.13) 


is called the tensor of the quadrupole moment of a system. Let us 
evaluate the trace of this tensor, i.e. the sum of its diagonal compo- 
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nents: 

Tr (Q hm ) = S Qkk = S 2 «« (34a - 4) = S «. I (34a - 4) 

k h a ah 

— S e a (3 4a — 3r|) =0 (43.14) 

a h 

The equality to zero of Tr (Qh, n ) signifies that of the three diagonal 
components of the tensor Q hm only two are independent, and, con- 
sequently, there are altogether five independent components. 

If we transform the tensor Q hm to the principal axes, owing to 
condition (43.14) only two of the three principal values are independ- 
ent. If a system of charges has an axis of symmetry of an order 
higher than two, this axis (we shall designate it by the letter z) 
is one of the principal axes of the tensor Qhm'’ the position of the 
other two principal axes is arbitrary. In this case, it is evident 
that Q xx = Q yy , and owing to (43.14), we have 

Q*x=Q n =—rQ» (43<15 > 

The principal value Q tl is called simply the quadrupole moment of 
the system in this case. It can he shown that when the total charge 
and the dipole moment of a system equal zero, the quadrupole mo- 
ment does not depend on the choice of the origin of coordinates. 

Using the symbol (43.13), we can write the potential of a field 
due to a quadrupole as follows: 

krr < 43 - 16) 

ft, m 

Let us calculate the second derivatives in this expression: 

d i i_ _ d ! d _ 1 _ \ d_ / 1 _ dr \ 

dxhdx m t d^h \ dx m r / dx k \ r 2 dx m ) 

9 / x m \ §hm | 3xm x k 

dx h \ r 3 / r 3 '' r? 

(dx m /dx h = 6h m )- Introducing this value of the derivatives into 
(43.16), we have 

= ■^ 2 -( i ^ 5i ~fiftm) (43.17) 

ft. m 

Examination of (43.17) shows that the potential of a quadrupole 
diminishes with the distance as 1/r 3 . We remind our reader that 
the potential of a monopole diminishes according to the law 1/r 
[see (43.4)], and that of a dipole according to the law 1/r 2 [see (43.9)]. 
In general, the potential of an n-th order multipole diminishes with 
the distance according to the law l/r n+1 
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Hence, the field of a system of charges can be represented as the 
superposition of the fields set up by multipoles of different orders: 

V ea , 

<!>(r) = <Po + q>i + <P2+---=“ PV T 

v o dl 1 i 
-i- 6 Zj Vhm dXh dXm r -t- • • • 

We shall not deal with multipoles of higher orders. We shall only 
note that a multipole of the third order is called an octupole. We 
can indicate as an example of an octupole a system of eight unlike 
charges identical in magnitude and arranged at the corners of a cube 
so that the closest neighbours are charges of opposite signs. 


44. Field in Dielectrics 

Up to now, we have been dealing with an electrostatic field produced 
by a given system of charges in a vacuum. It was assumed that 
the charges setting up the field can move over macroscopic distances 
{for instance, within the confines of the entire conducting body). 
We shall call such charges free. 

Matters become much more complicated in a field produced by free 
charges in dielectrics. Here the field set up by the charges in the 
atoms and molecules of the dielectric is superposed onto the field 
of the free charges. Since these charges cannot leave the confines of 
the atoms and molecules they belong to. they are called bound. 

If E free is the field of the free charges, and E b0U nd is that of the 
bound ones, the strength of the resultant field E res can be written as 

■^res = ^free Abound (44.1) 

Even if the free charges are stationary, the field (44.1) is not sta- 
tionary (i.e. time independent) because the bound charges are in 
motion inside the molecules and, in addition, participate together 
with the molecules in thermal motion. It is thus evident that the 
field E bound is a fast-varying function of time. In addition, E bouod 
changes greatly in the space between two adjacent molecules. Both 
kinds of dependence (on the time and on the point in the space be- 
tween molecules) vanish if we deal with a value of E bound that h as 
been averaged, first, over a time interval much longer than the pe- 
riod of intramolecular motion and thermal oscillations, and, second, 
over a volume considerably exceeding that of a molecule. Conse- 
quently, the field <E bound > is stationary. In addition, it changes 
smoothly within the limits of a volume including many molecules. 
The field (E bound ) is called macroscopic, unlike the microscopic 
field E bound . 
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We shall call the macroscopic quantity 

E = E free (Ebound) (44.2) 

the field strength in a dielectric. 

In the absence of an external field (i.e. a field of free charges), the 
field (E bound > usually equals zero. Under the action of an external 
field, the mean positions of bound charges are displaced the more, 
the stronger is the field acting on them. As a result, the field 
(Ebound) becomes other than zero. In calculating the field (44.2), 
matters become complicated by the fact that the average displace- 
ment of the bound charges is determined not by the field E free , 
but by the resultant field E that includes (E b0 und) i n it- 

It is customary practice to characterize the state of a dielectric 
by the dipole moment of a unit of volume of the dielectric, which is 
called the polarization and is designated by the symbol P. It is 
evident that P can be determined as 

^ pt 

P = ^f- (44.3) 

where AF is an infinitely small volume 1 , and p,- is the dipole moment 
of an individual molecule, summation is performed over all the 
molecules confined in the volume AV. 

Having determined P in this way. we have in essence performed 
the averaging mentioned above when discussing E b0U nd (P is a macro- 
scopic quantity, and p, is a microscopic one). 

When an electric field acts on a dielectric, the bound charges 
become displaced (each remaining within the confines of its “own” 
molecule), the positive ones along the field, and the negative ones 
in the opposite direction. The result is the formation of bound 
charges on the surface of the dielectric. In addition, space bound 
charges may also appear. 

Let us find an expression for pbound — the volume density of the 
bound charges. We mentally separate in a dielectric an infinitely 
small volume AF. Assume that in the absence of a field the bound 
charges e a (where a = 1 , 2, . . .) confined in this volume are at 
points determined by the position vectors r a0 . Since the dielectric 
is not polarized in the absence of a field, the expression \ e a r a0 is 
zero (the sum is taken over AF). Indeed, with an accuracy up to 
1/AF. this sum coincides with the polarization P, which vanishes 
in an unpolarized dielectric. 


1 By an infinitely small volume in physics is meant the volume if con- 
taining a sufficiently large number of molecules to allow us to ignore the dis- 
creteness of the substance, and at the same time small enough for us to consider 
that macroscopic quantities such as E or P are constant within the confines 
of AV. 
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Assume that when a field is switched on, the bound charges become 
displaced by the segments Ar a (we must note that these segments 
are much smaller than the linear dimensions of the volume AF). 
The result is the appearance of a polarization characterized by the 
vector 

P== Tv' 2 e «( r «o + Ar a )==-^r 2 e aAr a (44.4) 

a 


The bound charges in a dielectric can be divided into several 
groups, in each of which the magnitude of the charge e a and the dis- 
placement Ar a are identical. Let us number such groups with the sub- 
script (J. Let ng stand for the number of charges of group {5 per unit 
volume of the dielectric. Hence, the polariza- 
tion can be written as follows: 

T, 1 



AF 


2 ^ Argrcg AF = 2 n a e e Ar p (44.5) 

6 P 

(n g AF is the number of charges of group p 
contained in the volume AF). 

We calculate the algebraic sum of the bound 
charges intersecting the boundary of the vol- 
ume AF when the field is switched on. Fig- 
Fig. 44.1. ure44.1 shows an element df of the surface 

confining AF. The charges of the group bearing 
the number p contained in an elementary volume of the size Ar p df 
cross df and emerge from or enter the elementary volume. They carry 
along with them the charge 


nge p Ar p df (44.6) 

This expression is algebraic. Its sign depends on that of e e and on 
the sign of the scalar product Arp df, i.e. on the direction of Arg 
relative to the outward normal to df (in Fig. 44.1 it is denoted by n). 

Summating expression (44.6) over p, we obtain the total charge 
crossing df: 

' Vrigeg Ar p df = P df (44.7) 

[see (44.5)]. 

Having integrated expression (44.7) over the surface /, let us find 
the total bound charge emerging from the volume AF when a field 
is. switched on. When this happens, the volume AF, which was pre- 
viously neutral, acquires a bound charge: 

Abound ~ — J P df 

f 
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(the acquired charge equals the emerged one taken with the opposite' 
sign). Using the Ostrogradsky-Gauss theorem, we obtain 

Abound = ~ j VP dV — — VP AF 

4V 

Hence, the following expression is obtained for the density of bound' 
charges: 

Pbound = -VP (44.8)- 


Formula (44.8) will also help us to find the surface density of the- 
bound charges a bound . For this purpose, let us consider a cylindrical 
volume confined between two infinitely close sur- 
faces of the magnitude A / at different sides of the 
dielectric’s surface. (Fig. 44.2). The bound charge 
Abound confined in this volume can be represented 

either in the form j Pbound dV, or in the form 

Abound A/. Using formula (44.8) and the Ostro- 
gradsky-Gauss theorem, we can write 

ffbound “ CT bgund A/ r = j VP dV = j P n ' df 

where n' is an outward normal to the surface of 
the cylinder. The integral over the surface can be 
divided into three parts: an integral over the outer 
base of the cylinder that equals zero because P = 0 
outside the dielectric; an integral over the side 
surface of the cylinder that may be disregarded 
owing to this surface being infinitely small, and, 
finally, an integral over the inner base of the cylinder. The latter 
equals — P n - A/, where n' is an inward normal to the surface of 
the dielectric. If instead of n' we take the outward normal n, we 
must substitute +P n for — P n >. We have thus arrived at the 
expression cr bound A / = P n A/, whence 

Abound — Pn (44.9 y 

Here P n is the projection of the polarization P onto an outward nor- 
mal to the surface of a dielectric. 

We must note that charges of the density determined by formu- 
las (44.8) and (44.9) are not imaginary, but quite real charges. 

Hence, in the presence of dielectrics, the field of bound surface 
[see (44.9)3 and volume [see (44.8)3 charges is superposed onto the- 
field set up by the free charges (let their density be p). Consequently.. 



Fig. 44.2. 




170 


ELECTRODYNAMICS 


the potential at a point determined by the position vector r is 


<P (r) = j 


P (r') dV' 
R 


+ J 


Pn {?) df' 
, R 


V'P (r') dV' 
R 


( 44 . 10 ) 


where for brevity we have used the notation R — | r — r' |; dV' is 
an elementary volume taken in the vicinity of the point r', and df' 
is an element of the surface of the dielectric taken in the vicinity of 
the point r'; the divergence of P (r') is taken over the primed coor- 
dinates, therefore the operator V is primed [see (XI. 52)]. The first 
integral is taken over the volume where p is non-zero, the second in- 
tegral is taken over all the surfaces confining the dielectric, and, 
■finally, the third integral is taken over the entire volume of the 
•dielectric. 


45. Description of the Field in Dielectrics 

If VP differs from zero, every elementary volume of a dielectric 
is equivalent to a point charge of the magnitude — VP dV and makes 
the corresponding contribution to the macroscopic field E. There- 
fore, when dielectrics are present, Eq. (42.3) must be written as 

VE = 4n(p free + p bound) = 4n(p — VP) (45.1) 

(by p is meant the density of the free charges). The equation in this 
form is hardly suitable for finding E because it determines the latter 
not only in terms of the density of the free charges, but also through 
the nature of polarization of the dielectric. The polarization, in 
turn, is a function of E. 

It is easy to note that if we introduce the auxiliary quantity 

D = E + 4nP (45.2) 

the following equation holds for it: 

VD = 4rtp (45.3) 

that is, VD is determined only by the density of the free charges. 

The quantity D is called the electric displacement (or the electric 
induction). 

A comparison of Eqs. (45.1) and (45.3) shows that the operation 
of finding D is much simpler than that of finding E. There would be 
little good from the quantity D, however, if it were not for the 
■circumstance that in the majority of practically important cases D 
is proportional to E. It is therefore possible to use a “roundabout 
manoeuvre”: instead of the main characteristic of a field E, first the 
auxiliary quantity D is calculated, and then the transition is per- 
formed from D to E. 
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The introduction of the quantity D is also expedient because many 
formulas written with the use of D are much simpler than they would 
be when expressed in terms of E and P. 

Experiments show that in many cases having a practical interest, 
the polarization is proportional to the field strength: 

P = %E (45.4) 

<we are meanwhile limiting ourselves to a treatment of isotropic 
dielectrics). The quantity x is known as the electric susceptibility 
of a substance. It is always positive. Introducing (45.4) into (45.2), 
we find that 

D = E — 4.T)rE = eE 1 45.5) 

-where 

e = l + 4 jtX (45.6) 

as the relative permittivity or simply the permittivity 1 of a substance 
((also sometimes called the dielectric constant). 

Hence, D and E are often proportional to each other. This is ex- 
actly why it is expedient to introduce D. 

Proceeding from Eq. (45.3). it would seem possible for us to con- 
•clude that D is determined only by the density of the free charges. 
But this is not true. Equation (45.3) alone is not sufficient for deter- 
mining D. To understand this more clearly, let us recall how E is 
found in the absence of a dielectric. In addition to the equation VE = 
— 4np, we took advantage of the fact that [VE] = 0 and assumed 
that E = — V<p- Introducing this expression into the equation for 
the divergence, we arrive at Poisson’s equation: V 2 cp = — 4jtp. 
Solution of this equation allows us to find cp. and then E. 

If we follow the same path in finding D. in addition to (45.3) we 
must consider the equation 

[VD] = [V, eE] = [Ve, E] + e tvE] = [Ve, E] 

Iwe have used formula (XI. 27) and taken into account that [VE] = 
= 0]. This equation transforms into tvDl = 0 only when Ve = 0, 
i.e. when the dielectric is homogeneous. In the general case, tvD] 
depends on E. i.e. in the long run on the bound charges. 

Although D. however, depends in general on the bound charges, 
the collection of equations 

[VEl = 0, VD = 4np, D = eE _ 

allows us to calculate E and D according to the known distribution 
•of the free charges in space. 

Using relation (45.4). wo can determine the conditions in which 
Pbcumd differs from zero. Let us introduce the value given by (45.4) 

1 The absolute permittivity of a medium e a = e 0 e is introduced in electri- 
cal engineering. It is deprived of a physical meaning, however, and we shall 

not use it. 
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into formula (44.8): 

Pbound = -VP - -V (y.E) = -XVE - EVX (45.7) 

[see (XI. 26)]. By (45.1), we have VE = 4n (p, ree + Pbound)- Sub- 
stituting this value for VE in formula (45.7), we get 

Pbound = X^ Jl (Pfree "L Pbound) E VX 

Now let us solve the equation obtained relative to Pb 0 und : 

Pbound = ~ (^JtXPf ree "f" EVX) 

Inspection of the relation we have found shows that Pbound is 
non-zero, first, at the points where p free is non-zero, and, second, at 
the points where VX ¥= 0, i.e. at places where the dielectric is non- 
homogeneous. We must note that space bound charges do not appear 
in a homogeneous polarized dielectric (P = const). 

Let us consider the field in a homogeneous dielectric. Assume that 
in the absence of a dielectric with a given distribution of the free 
charges, a field is produced that is characterized by the strength E 0 
and the potential <p 0 . We know E 0 and cp 0 to be the solutions of the 
equations 

VE 0 = 4np (45.8) 

A(p 0 = — 4np (45.9) 

[see (42.3) and (42.4)]. 

Now, without changing the arrangement of the free charges (i.e. 
p), let us fill the entire space in which the field is non-zero with a 
homogeneous (e == const) isotropic dielectric. The field strength 
will therefore become equal to E, and the potential will be cp. Let us 
write equations for E and cp. According to (45.3) 

VD = v(eE) = 4np (45.10) 

Substituting — V<p for E in this equation and taking into account 
that e = const, we can write 

—V (eV<p) = — V' 2 (e<p) = 4np 
or 

A (ecp) = ~4np (45.11) 

A comparison of (45.10) with (45.8) and (45.11) with (45.9) shows 
that the equation for eE coincides with that for E 0 , and the equation 
for ecp with that for cp 0 . It follows that the filling of a space in which 
a field is present with a homogeneous isotropic dielectric leads to 
both the field strength and the potential becoming equal to 1/e-th 
of their initial values. Particularly, for the field of a point charge 
placed in a homogeneous dielectric, we have 


£ = 


er 

er 3 ’ 




e 

er 


(45.12) 
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It must be noted that by (45.11), Poisson’s equation (42.4) for a 
field in a homogeneous isotropic dielectric is as follows: 

A(p=_i£Lp (45.13) 

We shall indicate without a derivation (which can be found in 
many textbooks of general physics) the conditions which E and D 
must satisfy at the interface between two dielectrics: 

E t1 — E xi , £i E nl — EoE n2 | (45.14) 

D n i = E ) n 2 , D xl / £j = D x Je 2 J 


(the subscripts x and n denote the tangential and normal components 
of the relevant vector, respectively). 

It is a very difficult task to find the field by solving Eqs. (42.4), 
•(45.3), etc. in the general case. 

In cases involving symmetry, 
it is possible to establish the 
form of the field without solv- 
ing any equations. We shall 
show this using the following 
example. Assume that we have 
the plane boundary of two se- 
miinfinite homogeneous and 
isotropic dielectrics with the 
permittivities ej and e 2 . There 
is a point charge q in the first 
dielectric at the distance a 
from the boundary. We are to 
find the field in both dielec- 
trics. 

We shall form the field in 
the first dielectric from those Fig. 45.1. 

of the point charge q and its 

mirror image — the imaginary charge q' . This assumption satisfies 
the main condition that the first dielectric contains only one source 
of D— the charge q (q’ is outside the first dielectric). We shall thus 
seek the potential in the first dielectric in the form 



<Pi = 



( 45 . 15 ) 


(what r and r' are is clear from Fig. 45.1). 

We shall represent the field in the second dielectric as that of the 
imaginary charge q" placed where q is. This assumption agrees with 
the fact that there are no sources of D in the second dielectric ( q " is 
outside the second dielectric). We shall thus seek the potential in 
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the second dielectric in the form 


<P 2 = 


r 2 r 


(45.16> 


Let us try to choose the values of q' and q" so as to satisfy the 
boundary conditions (45.14) for D. According to our assumptions, D' 
has the following values in the first and second dielectrics: 







(45.17> 


At the boundary of the dielectrics, | r | = | r' |. Consequently, 


Djb — 




I>2b = 


111 

r 3 


(45.18> 


Let us find the normal components of the vectors in (45.18). We 
direct the vector n from the first dielectric to the second one. Taking- 
into account that the projection of r onto n equals a, and the pro- 
jection of r' equals — a, we get 


£ln = 


7 — 9 


a. 



Since the tangential components of the vectors r and r' are the 
same, i.e. r x = r x , we can write 

D lx = -^-r x , D Zx — r x (45.19) 

It follows from the equality D ln = D Vl that 

q-q’ = q” (45.20) 

After introducing the values from (45.19) into the relation Z) lt /e 1 == 
= Z> 2T /e 2 [see (45.14)1, we obtain 


<? + <?' = -^-9' (45.21) 

The* simultaneous solution of Eqs. (45.20) and (45.21) yields the 
values of q and q" satisfying the boundary conditions: 


9' 


= 9 


g l £ 2 

e 1 +e 2 ’ 


9 — 9 


^c 2 

e, + e 2 


(45.22) 


We have succeeded in “constructing” the functions D x and D 2 [see- 
(45.17)1, each of which satisfies the equation VD = 4np in its region. 
In addition, these functions satisfy the boundary conditions. There- 
fore, the functions (45.17) and, accordingly, (45.15) and (45.16) 
are the solutions of the problem [after introducing the values (45.22) 
for q’ and q” into them!. According to the uniqueness theorem (see- 
Sec. 42), there are no other solutions. 
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46. Field in Anisotropic Dielectrics 

In anisotropic dielectrics, the directions of the vectors P and E v 
generally speaking, do not coincide (see Appendix X). The relation 
between the components of these vectors is given by the expressions 

Pi='2x,kE h (< = 1,2, 3) (46.1). 

*=»i 

where % ih is a symmetric tensor of rank two known as the electrical 
susceptibility tensor. 

By (45.2), we have D — E -f 4 jtP or. in components. 

Dj = E t + An Pj 

Introducing P; from (46.1), we get 

Di = E, + An 2 XtkEk 

ft 

If we represent E t in the form ^\& ih E h , we can write 

h 

Di~zj SjfcPjt + 4it 2 Xih^h — 2 (^ift -f 43TXife) Eh 

k h h 

The quantities 

= 5. h + 4nx ift (46.2) 

are clearly the components of a symmetric tensor of rank two. It is 
called the permittivity tensor. With its aid, the relation between the 
vectors D and E can be written as 

D t =^e th E h (< = 1,2, 3) (46.3) 

k 

The symmetric tensor has six independent components. If it is 
reduced to the principal axes, it appears as follows: 

/e, 0 0\ 

(«,*) = 0 e 2 0 

\0 0 zj 

We must note that since the principal values of the tensor % ik are- 
positive, those of the tensor e ih are always greater than unity. 

In crystals of the triclinic, monoclinic, and rhombic systems, 
all three principal values of the tensor e tft and. consequently, the- 
semiaxes of the tensor ellipsoid are different. Such crystals are 
known as biaxial ones. 
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In crystals of the tetragonal, rhombohedral, and hexagonal sys- 
tems, two principal values coincide: £j = e 2 ^ e 3 . The tensor 
•ellipsoid in this case is an ellipsoid of revolution. Such crystals are 
known as uniaxial ones. 

In crystals of the cubic system, all three principal values of the 
tensor e,-* are the same so that the tensor has the form e6 ift . The 
tensor ellipsoid in this case degenerates into a sphere. Such crystals 
•do not differ from isotropic bodies in their dielectric (and optical) 
properties. 




Chapter IX 


MAGNETOSTATICS 


47. Stationary Magnetic Field in a Vacuum 

A point charge <? moving at the velocity v experiences in a mag- 
netic field the force 1 

F = |[vB] (47.1) 

(c is the speed of light in a vacuum). The vector quantity B called 
the magnetic induction is the basic (strength) characteristic of 
a magnetic, field. Relation (47.1) may be considered as the definition 
of the quantity B. 

Owing to the absence in nature of magnetic charges similar to the 
electric charges 2 e. the lines of the vector B have neither a beginning 
nor an end. This is why the flux of the vector B through any closed 
surface always equals zero: 


$£»*/ = 0 (47.2) 

f 

Formula (47.2) is an analytical expression of Gauss’s theorem for 
the magnetic induction vector. 

Using the Ostrogradsky-Gauss theorem, expression (47.2) can be 
written as 

JvBiF = 0 (47.3) 

v 

Conditon (47.3) must be observed for any arbitrarily chosen volume 
V. This is possible only if the integrand fund ion is zero at each 
point. We thus arrive at the conclusion that the divergence of the 
magnetic induction vector is zero at every point of a field: 

yB — 0 (47.4) 


1 The sum of the forces (41.1) and (47.1) is called the Lorentz force. 

2 Proceeding from the fact that the equations of physics in general and those 
of electrodynamics in particular must be symmetric, Paul Dirac advanced the 
hypothesis that magnetic charges (Dirac’s monopoles! ought to exist in nature. 
Searches for these charges have meanwhile given no results, so that the question 
of the existence of Dirac’s monopoles remains open. 
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It is proved in vector analysis [see (XI. 44)] that the divergence of 
the curl of a vector function always equals zero. Therefore, the mag- 
netic induction can be represented as the curl of a certain vector 
function A: 

B = [VA] (47.5) 

The function A is called the vector potential of a magnetic field. 

The vector potential, like its scalar counterpart cp, is determined 
non-uniquely. Indeed, since the curl of the gradient of any function 
is zero [see (XI. 43)], the addition to the vector potential of the quan- 
tity V»f> (hereij) is an arbitrary function) does not change the value of 
[VA], i.e. B. Hence, if A is the vector potential corresponding to a 
given magnetic field, the function 

A ? = A -j- Vt|) (47.6) 

is also a vector potential of this field. 

The property (47.6) allows us to choose the potential in the most 
convenient way, for instance, to impose definite conditions on the 
divergence of A. Indeed, it can be seen from (47.6) that 

VA' = VA -f V (Vi|)) = VA -f AtJ) 


so that the proper choice of ip can give VA' any preset value. Within 
the scope of magnetostatics, we shall choose ip so that 

VA = 0 (47.7) 


To illustrate what has been said above, let us consider the vector 
potential of the homogeneous magnetic field B = const = B 0 . Let 
us direct the z-axis along B 0 . Hence B x — B y = 0, B z — B 0 , and 
Eq. (47.5) written in components becomes 


x dy dz v dz dx 

dA V dA x _ R 
dx dy 


(47.8) 


It can be seen that these equations are satisfied, say, by the follow- 
ing value of the potential: 


A x = - B 0 y , A y = 0, A z = 0 (47.9) 

Figure 47.1a depicts the lines of the vector A having the compo- 
nents (47.9). 

Solution (47.9) is not the only one. Equations (47.8) are also 
satisfied by the following potential: 

A x = 0, A y = B 0 x, A z = 0 (47.10) 


The lines of A for this case are shown in Fig. 47.15. It is evident that 
A x = —a B 0 y, A v = (1 — a) B 0 x, A z — 0 
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where a is any number, will also be a solution. Particularly, Eqs. 
(47.8) are satisfied by 

A x =-±B„y, A v = \B 0 x, = 0 (47.11) 

Solution (47.11) can be represented as 

^x — ~ 2 ’(^y z V)i ^y = ~2 Bx z )> A z ~-^-(B x y B y x) 

(remember that B x — B y — 0), whence 

a = 4 - ivj =4-[ B „ R ] ( 47 - i2 > 

where R is the component of the position vector r perpendicular to 
the z-axis. The lines of the vector A corresponding to (47.12) are 
shown in Fig. 47.1c. 



Fig. 47.1. 


All the values of the potential we have given satisfy condition 
(47.7). We thus conclude that Eqs. (47.5) and (47.7) do not complete- 
ly determine A. For its determination to be unique, we must set 
the boundary conditions for it. 

48. Poisson's Equation for the Vector Potential 

It is known from the general course of physics that the circulation 
of the vector B around any closed contour T taken in a stationary 
magnetic field (in a field of steady currents) is proportional to the 
algebraic sum of the currents enclosed by the contour: 

§5 ; dZ=4 L E i (48.1) 

r 

We shall treat formula (48.1) as a relation established experimen- 
tally. 


12* 




180 


ELECTRODYNAMICS 


Introducing the current density j, we can represent the sum of the 
currents as the flux of the vector j through the surface confined by 
the contour F. Formula (48.1) thus becomes 

j 5, dl = j j dt 

r f 

Lei us transform the left-hand side of the relation we have obtained 
according to Stokes’ theorem. The result is 

f [VB] df:«%~ f j df (48.2) 

' J f 

Assume that integration on the left-hand and right-hand sides is 
performed over the same surface [although Eq. (48.2) is also ob- 
served for different surfaces if only they rest on the same contour T]. 
Relation (48.2) must be observed for any arbitrarily taken surface /. 
This is only possible if at every point of the field 

[VB]=J^j (48.3) 

(we remind our reader that we are considering the field of steady 
currents). 

Equation (48.3) plays the same basic role in magnetostatics as 
Eq. (42.3) does in electrostatics. Together with Eq. (47.4), it allows 
us to calculate the field of preset stationary currents. 

Let us introduce the curl of A [see (47.5)1 instead of B into for- 
mula (48-3): 

[V, EVA]] = ^-j 

According to (XI. 45), we have [V, [VA)1 = V (VA) — A A. As- 
suming, as we agreed upon [see (47.7)], that VA = 0, wo get the fol- 
lowing differential equation for A: 

AA=--^-j (48.4) 

This vector equation is equivalent to three scalar equations: 

AA k = —~j h (k — x, y, z) (48.5) 

each of which is similar to Poisson’s equation for <p [sec (42.4)]. 

Equations (42.4) and (48.5) are equivalent from the mathematical 
viewpoint. Consequently, substituting A h for cp and jjc for p in the 
Solution of Eq. (42.4), we get the solution of Eq. (48.5). With a 
view to formula (42.7), we obtain 

(* = *^< z ) (48.6) 




MAGNETOSTATICS 


181 


where the integral is evaluated over the entire region in which the 
currents producing the field flow. 

The three expressions (48.6) can be combined into a single vector 
one: 


A = 



j (r') dV' 
|r — r' | 


(48.7) 


Formula (48.7) allows us to calculate the vector potential of the 
field set up by currents according to their known distribution in 
space. Let us consider the field of a line (straight) current as an 
example. It is general knowledge that the potential of the electric 
field produced by a thin infinitely long uniformly charged filament 
can be written as* (we are considering the field outside the filament) 


tp(#)= -2Un-|-= _2paln-£- (48.8) 

■HQ Hq 


where R is the distance from the filament, R 0 is the distance to 
points whose potential is taken as zero (in the given case we cannot 
assume that <p = 0 at infinity because with such normalization the 
potential is infinitely great at finite values of R), and ). is the linear 
density of the charge which, assuming the latter to be uniformly dis- 
tributed over the entire cross section of the filament, can be written 
as pa (here a is the cross-sectional area of the filament). 

Now assume that we have a thin infinitely long straight wire 
through which a current of density / flows that is uniformly distri- 
buted over the cross section a. Directing the z-axis along the wire, 
we have j x — j y = 0, j z = ). On the grounds that Eqs. (42.4) and 
(48.5) are identical, we can obtain an expression for A z by substi- 
tuting jjc — j/c for p in (48.8): 


!»-£-= -fln-£- (48.9) 

where i = jo is the current flowing through the wire. The introduc- 
tion into (48.8) of j x = 0 and /,, = 0 instead of p yields zero values 
for the components A ;v and A v . Hence, the vector A can be written as 

A =— T ln 4r^ ( 48 - 10 ) 

where e z is the unit vector of the z-axis. 

Taking the curl of expression (48.10), we find B: 

B_[vA1--4-[v. (In -£-«,)] 

= -f [>'"-£)• e -]-f 


1 It is a simple matter to find that E (/?) = 2 /JR with the aid of Gauss’s 
theorem. Consequently, dtpidR = —2 EIR. Integration leads to formula (48.8). 
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[we have used formula (XI. 27)]. Since e z = const, the second term 
vanishes. The gradient of the function In (R/R 0 ) equals R/i? 2 . There- 
fore, 


B= — c z ] = ^ 4 le - R] ( 48 - 41 ) 


According to the result we have obtained, the vector B at every 
point is in a plane perpendicular to the wire and directed along a 
tangent to the circle encompassing the wire. The magnitude of 
B is 



49. Field of Solenoid 

Let us calculate the field of an infinite solenoid, representing it 
as an infinite cylinder of radius a in whose surface layer of thickness 
b (where 6 <a) a current of density / flows (here bj is equivalent to 

ni, where i is the current in the sole- 
noid, and n is the number of its turns 
per unit length). 

We choose rectangular coordinate axes 
and make the z-axis coincide with 
the geometrical axis of the solenoid 
(Fig. 49.1). The projections of the 
vector j onto the coordinate axes are 

/x= — / since = — /-|- , 

75 , = /cos a = , j z = 0 (49.1) 

It immediately follows from j z = 0 
that A z — 0. In accordance with what 
was said in Sec. 48, the component A x coincides with the potential cp 
produced by the charge distributed in the surface layer of the cylin- 
der with the density p x — jjc = — (j/c) (y/a) — — p 0 (y/a). We have 
introduced the symbol 



< 49 - 2 > 

Similarly, the component A y coincides with the potential of the 
charge distributed in the surface layer with the density p„ = 
= } u /c - p 0 (x/a). 

The density, which changes according to the law — yla, can be 
obtained as follows. Let us insert a cylinder charged uniformly with 
the volume density +p 0 into another cylinder charged uniformly 
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Fig. 49.2. 


with the volume density — p n . As a whole, the system of two cylin- 
ders is neutral at every point. Now let us move the negatively charged 
cylinder over the length 5/2 in the direction of the y- axis, and 
the positively charged cylinder over the same distance in the oppo- 
site direction (Fig. 49.2a). Since b <c« (here a is the radius of the 
cylinders), the formed system may be considered as a cylinder in 
whose surface layer of thickness b a charge having the varying den- 
sity p T is distributed. Actually, the density of the charge in the sur- 
face layer is constant and equals p 0 (or — p„). while what does change 
is the thickness of the uncompensated layer. A glance at Fig. 49.2a 
shows that this thickness varies according to the law b sin a so that 
the charge per unit of cylinder surface is — p n b sin a= — p 0 5y/a. 
Our error will not be noticeable, however, if we consider the thick- 
ness of the charged layer to be the same everywhere and equal to b, 
and the charge density in this layer to vary according to the law 
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To obtain a density varying according to the law x/a, let us shift 
the cylinders as shown in Fig. 49.2 b. Now the density of the charge 
in an imaginary surface layer of thickness b will vary according to 
the law 

P y = -^f (49.4) 

In the cases being considered, the fields are the superposition of 
the two fields +To and — (p 0 , identical in magnitude but opposite in 
sign, displaced relative to each other over the very small distance b 
{the latter is directed oppositely to the y-axis for p*, and along the 
x-axis for pj,). Assume that initially the fields were accurately super- 
posed, as a result of which the potential at the point determined by 
the position vector r was [+(p 0 (r)] + [ — cp 0 (r)], i.e. zero. Now let us 
shift the field +<p 0 by b/2, and the field — cp 0 by — b/2. Hence, the 
potential +<p 0 that was at the point (r — b/2) and the potential — To 
that was at the point (r -f- b/2) will be at the point with the position 
vector r. Consequently, 

<P( r ) = <Po(r — \) -<Po ( r + ir) 

Owing to the smallness of b, both terms of the expression we have 
obtained can be transformed by formula (XI. 5), i.e. can be written as 

T (r) = [% (r) — VTo • b/2] — [To (r) + VTo • b/2] 

For p*, the vector b has the components b x = b z = 0, and b y = 
= — b, so that 

T ( r ) = b (for ^4*) (49.5) 

for pj,, we have b y — b z = 0, and b x — b, so that 

T(r)=--^5 (for A,) (49.6) 

It now remains to find cpo and introduce its derivatives into (49.5) 
and (49.6). Recall that To is the potential of the field produced by a 
cylinder of radius a charged with the constant volume density +p 0 . 
The potential inside and outside such a cylinder is determined by 
different formulas. 

Field Inside a Solenoid. We can easily find with the aid of Gauss’s 
theorem that the field inside a charged cylinder is E — 2jtp 0 R, 
where R is the distance from the axis of the cylinder ( R <Z a). The 
potential cp 0 — — Jtp 0 i? 2 + const = — np 0 (x z + y z ) + const corre- 
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sponds to this field. Its derivatives are 


= ~ 2n Po x > eT = ~ 

Let us introduce the found values of the derivatives into (49.5) 
and (49.6): 

cp (r) = — 2np 0 by (for A x ) 

cp (r) = 2np 0 bx (for A y ) 

Substituting nilbc for p 0 in these expressions [see (49.2)], we obtain 
expressions for A x and A v (it was established above that A, = 0)." 
. 2itni 1 / 4 nni \ . 

4 , = — —v=--(—)y 


1 / 4 nni \ 


(49.7), 


A x = 0 


It was shown in Sec. 47 [see (47.11)] that the potential (49.7) de- 
termines a homogeneous magnetic field with the magnetic induction 


D . 

B — ni 

C 


(49.8) 


parallel to the 2 -axis, i.e. to the axis of the solenoid. 

Field Outside a Solenoid. The potential outside a cylinder charged 
with the density p 0 is 

<Po = — 2p 0 na 2 In 

where R is the distance from the cylinder axis (R > a), and R 0 is- 
a constant [compare with (48.8)]. The derivatives of <p 0 can be writ- 
ten as 

frPo _ 2n jt a 2 x d(p ° — — 2o na 2 y 

dx — *Pon a R i > dy — ^Po na R i 

Substitution of these values into (49.5) and (49.6) yields 
cp (r) = — 2p 0 na 2 b (for A x ) 

cp (r) = 2p 0 na 2 fe -jp- (for A y ) 

Substituting nilbc for p 0 , we obtain A x and A y . Let us write the 
values of all three components: 

. 2 nnia* y v V \ 


A x = — 


u _ _ v- _y_ 
R 2 R 2 


, 2 ixnia 2 x v x 

A V - c li r ~ R 2 

A z = 0 


(49.9) 
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The quantity R 2 = x 2 + y- does not contain z. Consequently, 
dAjdz = dAyldz = 0. Since A z = 0, it follows that dAjdx — 
— dAjdy = 0. Hence, B x and B y vanish. Let us find B z : 

R __ dA « _ dA x _ K r _J_ _ _2^_ _X_ \ _ / 1_ , _2^_ _y_ \ q 

Dz ~ dx dy ~ l \ R* ~ R 3 R ) \ R* ^ R J R ) J 


_ ^ r_2 2 ^+i/ 2 1 _ n 

~ L R 2 R* R 2 J 


We have thus found that the field outside an infinitely long sole- 
noid is zero. The vector potential outside the solenoid, however, is 
non-zero. The collection of formulas (49.9) can be written as a single 
vector formula: 


2nnia 2 , r 


(49.10) 


A comparison with formula (48.11) shows that the field of the vector 
A outside the solenoid has the same nature as the field of the vector 
B around a straight long current-carrying conductor. 


50. The Biot-Savart Law 

The expression 


«=r! 


1 (O dV' 


(50.1) 


makes it possible, knowing the distribution of currents in space, to 
•determine the vector potential of a magnetic field [see (48.7)]. Let 
us attempt to find a formula that would allow us to find B directly 
according to the preset currents. For this purpose, let us calculate 
the curl of the function (50.1). It must be borne in mind that inte- 
gration in (50.1) is performed over the primed coordinates x' , y', z', 
whereas differentiation in taking the curl is performed over the un- 
primed coordinates x, y, z. Therefore, we may exchange the places 
•of the operations of integration and taking the curl. With this in 
view, we obtain 

B (r) == [V, A (r)] = -1 { [ V, i] I ^ r ] dF' 

Considering j (r')/| r — r' | as the product of the vector function 
j (r') and the scalar function 1 / 1 r — r' |, we shall use formula (XI. 27): 

[v. 1 ^H(v! 1 ^),j( r ')] + 1 ^[v 1 HOI 

The second term vanishes because j (r') contains no unprimed coor- 
dinates. The gradient in the first term is 

^ 1 w 1 _ r— r' 

V |r— r'l ~ V V v _ l f — r' I s * 




MAGNETOSTATICS 


187 


We have thus arrived at the formula 




i* — r' 

|r — r'"\* 


j(r ')}dV' 

_J_ f (j(r'), (r-r')jdK' 
' c J I r — r' | 3 


(50.2) 


(we have put the scalar factor outside the sign of the vector product 
and, in addition, have changed the places of the factors, which re- 
sulted in vanishing of the minus sign). 

The formula obtained is a solution of the problem we posed — it 
allows us to calculate B from the preset currents. Formula (50.2) is 
simplified if the currents flow only through thin wires. Figure 50.1 




shows a portion of such a wire. Inspection of the figure shows that 
the expression j dV' can be written as 


j dV' — jo dl = jo dl = i dl (50.3) 

where or is the cross-sectional area of the wire, i is the current, and 
dl is a vector coinciding with the wire element dl and having the 
same direction as the current. Substituting i dl for j dV' in (50.2), 
we obtain 

b -= 4- 1 ( 50 - 4 ) 

(integration is performed over the length of all the wires). 

Formula (50.4) is an analytical expression of the Biot-Savart law. 
Figure 50.2 explains that r — r' is a vector drawn from the 
point where the current element dl is to the point for which B is 
being calculated. 
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51. Magnetic Moment 

Before turning to the topic of the present section, let us obtain the 
continuity equation, which is a corollary of the law of charge con- 
servation. 

Assume that currents characterized by the density j — j (r) flow in 
a certain region. Let us separate an imaginary volume V confined by 
the surface / in this region. The charge flowing outward through this: 
surface in unit time can be written as 

$> in df = [vjrfV 

/ V 

The above expression equals the rate of diminishing of the charge- 
confined in the volume V, which is determined by the expression 



(we have written the partial derivative with respect to t because p 
is a function not only of time, but also of the coordinates). 

Equating the two expressions, we obtain 

]vidV--\%dV 

V V 

The above equation must be observed for any arbitrarily chosen 
volume. This is possible only upon equality of the integrands at 
every point of space. We thus arrive at the relation 

Vj=-f (51.1) 

known as the continuity equation in the differential form. In the 
integral form, the continuity equation is 

§j n df=-£ jpdV (51.2} 

Now let us consider a system of stationary currents circulating 
within a restricted volume V and calculate the magnetic field set up 
by this system at distances that are great in comparison with the 
dimensions of the system. This problem is similar to that treated in 
Sec. 43. 

First of all, we must note that owing to the stationary nature of 
the currents the accumulation or dissipation of charges cannot occur 
at any of the system’s points, i.e. the condition dpldt = 0 must hold 
everywhere. It thus follows by (51.1) that within the system 

Vj = 0 


(51.3) 
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Further, there are no currents outside the system. Hence, every- 
where on the surface confining the system, we have 

)n = 0 (51.4) 

■(the currents do not intersect the surface). 

Finally, we shall prove that 

j j dF — 0 (51.5) 

v 

where the integral is evaluated over the entire volume of the system. 
Owing to the stationary and restricted nature of the system, all the 
■current tubes 1 that can be separated inside the system are closed. 
Consequently, the entire volume of the system can be divided into 
■closed current tubes. For each of the tubes, the integral over its 
volume can be transformed as follows: 


j j dV — j jcr dl — dl — dl = 0 {(51.6) 

over tube 


la is the cross-sectional area of a tube, i is the current, and dl is an 
element of length of a tube, see formula (50.3)]. Summation of ex- 
pression (51.6) over all the current tubes yields formula (51.5). 

Let us choose the origin of coordinates inside the system and write 
an expression for the vector potential: 



j (r'l dV' 
! r— r' | 


(51.7) 


Here r is the position vector of the point for which A is being calcu- 
lated, and r' is the' position vector of the point in whose vicinity the 
elementary volume dV' is; integration is performed over the primed 
coordinates within the confines of the system’s volume. 

Taking advantage of the fact that according to our assumption 
r' <r, let us expand expression (51.7) into a series 2 in r’lr. To 
within first-order terms, we have 

A(r)=4J -^^-4 f jj(r') (51.8) 

v v 


1 By a tube of an electric current is meant the same as by a tube of a flow 
in a fluid, i.e. the volume confined by lines tangents to which at each point 
coincide with the direction of the vector j. 

2 Considering — r' as a small increment of the position vector r, we write the 
function / (r) = 1 Ir for the value of the argument r — r' as 

/{r -r') = / (r) + V/(r)(-r') 
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The first term of (51.8), owing to (51.6), vanishes. Consequently, 
substituting for V (1/r) its value, we obtain 

A W-iJ ilr '^ 


Let us write the k- th component of the potential: 

A h = j h 00 ( 2 x i x i) dV ' = 773- 2 *1 J' )k x 'i dV' 


(51.9) 


(we have expressed the scalar product rr' through the components of 
the position vectors). The products i h x[ are the components of a ten- 
sor of rank two 1 . Let us write this tensor as the sum of a symmetric 
and antisymmetric tensors: 


1h x i = 


ihx\—iix' k 


'■ Shi + Ahi 


(51.10) 


[see (X.27)]. 

Let us prove that the integral of the symmetric component of the 
tensor (51.10) is zero. For this purpose, we shall use the identity 


V' (x^aJ) = j V' (x'iXk) +x\xkS7'j (51.11) 

[see (XI. 26)]. In view of (51.3), the second term on the right-hand 
side vanishes. Let us write the expression V'( x\x'i ,) using the for- 
mula for the gradient of the product of scalar functions [see (XI. 25)]: 
V' (x'ix’ k ) = x,'V' x’ k + XfeV ' A 

. . dx' b __ dx' __ 

= x 'i 2 e m -W + x ' h 2 e m -foT = 2 e m ( x 'Am + x 'h^im) 
m m f 

m mm 


(we have taken into account that dx' k ldx' m = 5 hm ). The expression 
in parentheses is the m-th component of the gradient of the function 

X\X 

Now let us calculate the second term of expression (51.11): 

jv' {x'iXk) = 2 7m {V' ( x\x h )} m = 2 7m (Xi&km + x 'h&im) 

' mm 

~ ZJ ]m x i^km T~ 2 ]m x h^im 

m m 


In the first sum on the right-hand side, only the addend with m = 
= k is non-zero, and in the second — the addend with m — i. Hence 

jV f {x^) = ] h x'i -f Ux'k 


1 The integral Jftf = j j h x\ dV' is also a component of a tensor. Consequently, 

the vector A to within the factor \lcr i is the product of the tensor J ft j and the 
vector Xj [see (X.22)]. 
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Therefore, by (51.11) 
Shi : 




2 


= Y V' (jcjarii) 


Let us take an integral of this expression: 

[ S h i dV' =-j ^ V' {x'ix'ni) dV' = 4 § x ’i x kin d f 
V " ,v V 

(we have employed the Ostrogradsky-Gauss theorem). In view of 
(51.4), the last integral vanishes. We have thus proved that the 
integral of the symmetric component of the integrand on the right- 
hand side in (51.9) vanishes. This allows us to write expression 
(51.9) as 



i V 


( {E*i (hx'i-jiXk)} 

V i 


dV' 

(51.12) 


We shall show that the integrand equals the k - th component of 
the triple vector product [r [ jr' ] ]. Introducing the auxiliary notation 
b = [jr'l and using formula (VI. 33), we can write 


[r, [jr'llh = [rb] k = S S 2 e imnimXn 


i, l 


i, l 


i. I, m. n 


(we remind our reader that in a cyclic rearrangement of the subscripts 
the value of e !mn does not change). We summate over the subscript l 
using relation (VI. 16): 

l*") (jr j]/j = 2 ^hm^in^ilm^n 2 im-G/ m^n 

i, m.|n i, m.n 

Now let us summate over the subscripts m and n. In the first sum, 
the addends with m = k and n = i are non-zero, and in the second — 
the addends with n = k and m — i. Hence, 

[r, [jr ]|/ ( = _j Xjj^x ; XijiXh — __ Xj (jhX , /(•*•*) 

i i i 

A comparison with (51.12) allows us to write 

A k = 5 Ir ’ [jr ' )lft dV ’ 

v 

or in the vector form 

A = lM [|r'U.r|<n" = TJ-[{s- ( m dV '}' r ] < 51 - i3 > 

V V 

(wo have exchanged the places of the factors in both vector products). 
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The quantity 

m = -^J[r'jWF (51.14) 

v 


is known as the magnetic moment of a system 1 . 

For a system of discrete charges, the expression of the magnetic 
moment is 


1 V 


[raVj 


(51.15) 


The latter is obtained from (51.14) if we assume that 
j 0*') = P (r') v (r') = 2 e Q v (r') 6 (r' — r^) 

a 

[see (41.12)] andj have in mind that J e a \ (r') 6 (r' — r„) dV' taken 

in the close proximity of the point r,) transforms into e a v a . 

The magnetic moment depends only on the properties of a system 
and, as can readily be seen, does not depend on the choice of the 
•coordinate system. Indeed, let us displace the origin of the coordi- 
nate system over the distance b. The position vectors in the new sy- 
stem will now be x" — r' — b. The magnetic moment in the new 
system is 

»' = e( = [(r'_b), JSdr-i ( [r'JXiK — y\ |bj| dV 

V V V V 


The first term equals m, and the second one can be written as 

But according to (51.5), j j dV vanishes. Consequently, we have 

•arrived at equality of the moments m' and m, Q.E.D. 

With a view to (51.14), expression (51.13) can be written as fol- 
lows: 

A(r)=-^= -[m, V-}] [(51.16) 

[compare with formula (43.7)]. 

To find B, we must calculate the curl of expression (51.16). As- 
suming in formula (IX. 29) that a = m and b = r/r 3 , we obtain 

B = [VA] = [v, [m, -pr]] = (rir v) m-(mv)^ + m(v^j 

-73T (Vm) 


1 We have omitted the prime on dV because the integrand contains no un- 
primed coordinates. In the expressions containing both r' and r, the prime 
on dV indicated that integration is performed over the primed coordinates. 
In the following we shall also omit the prime on r in this expression. 
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The vector m does not depend on r, therefore the first and last terms 
on the right-hand side vanish. According to (XI. 26) 

V^=-v|+ivr,r(-ii)+^3-0 

Therefore, the third term also vanishes. Hence, 

B=-(mV)-£ (51.17) 

By formula (XI. 33), we have 

(«“V)7 = r(m.V-J-) + ^(mV)r=— r(m.i-i)+^m 

m — 3e r (me r ) 


[see (XI. 34), e r is the unit vector of the vector r]. Consequently, 

B = - ? - Cr( - m y~ m (51.18) 

A comparison with formula (43.10) shows that a magnetic field is 
expressed in terms of the magnetic moment by a formula like the 
one expressing an electric field in terms 
of the electrical dipole moment. 

In concluding, let us calculate the 
magnetic moment of a current flowing 
through a thin wire that forms a plane 
loop. We choose the origin of coordinates 
in the plane of the loop (Fig. 51.1). 

According to the definition (51.14) 


m 


=i J i r « 


dV 



In the case being considered, we can 
make the substitution j dV == i dl [see 
formulas (50.3) and (51.6) Consequently, 
the expression for m can be written as 

m = -^- ^ [r, dl] 

Designating the unit vector of a normal to the plane of the loop by 
the symbol n (in Fig. 51.1 this unit vector is directed beyond the 
drawing), the integrand can be written as nr sin a dl so that 

in ? r sin a dl 

m =T§ 2 — 

A glance at Fig. 51.1 shows that the integrand equals the area of 
the hatched triangle. The integral therefore equals the area / of the 
loop. We have thus arrived at an expression for the magnetic moment: 

m = — i/n 
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52. Field in Magnetics 

In the presence of magnetics, the field produced by the molecules, 
B mol , is superposed on the field of the conduction currents B ; - so 
that the resultant, microscopic magnetic field can be written as 

B re » = B, + B mol (52.1) 

The field Br eS i like the field E res determined by formula (44.1), is a 
fast-varying function of time. In addition, it varies greatly in the 
space between two adjacent molecules. This is why the macroscopic 
quantity 

B = B, + <B mol > (52.2) 

is taken as a characteristic of the magnetic field in magnetics. It is 
called the magnetic induction in the magnetic. The microscopic 
field B mo i is averaged in the same way as the field Eb 0U nd (see 
Sec. 44). 

It is customary practice to characterize the state of a magnetic 
by the magnetic moment of a unit volume of the magnetic, known as 
the magnetization. We shall denote the magnetization (which is 
a vector) by the symbol M. It is obvious that M can be determined as 


2m, 

M = (52.3) 

where dV is an infinitely small volume, and m, is the magnetic mo- 
ment of an individual molecule; summation is performed over all 
the molecules contained in the volume dV. 

The contribution made by a magnetic to a macroscopic magnetic 
field can be calculated by formula (51.16). According to (52.3), the 
volume element dV' of a magnetic has the magnetic moment dm = 
= M dV'. Consequently, at the point determined by the position 
vector r, this volume element produces the magnetic potential 

dA (r) = dV (52.4) 


[r' is the position vector of the point where the volume element dV’ 
is, and M (r') is the magnetization at this point]. 

The integral of expression (52.4) evaluated over the entire volume 
of a magnetic gives the contribution made by the magnetic to the 
macroscopic magnetic potential. It must be added to the magnetic 
potential produced by the conduction currents [see formula (48.7)]. 
Consequently, in the presence of a magnetic, the field is [character- 
ized by the potential 



j (r') dV' . f fM(r'), (r — r')] 
|r— r'| ' J |r— r'l 3 


d7W, + 7, 


(52.5) 
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The second integral in this expression can be transformed as fol- 
lows: 

-]f¥(O.V( lr i T )]'dr 


(the prime on V signifies that when taking the gradient, differenti- 
ation is performed over the primed coordinates). 

Assuming in formula (XI. 27) that <p = 1/| r — r' | and a = 
= M (r'), we obtain 


[v, tHt] = [v' (ttAt) . M (>•)] + < i^Tt) [V, M (r')l 


whence 



V' 


( 1 \1 [V',M(r')] 

l !r-r'| [r-r’3 



M(r') 1 
|r — r' | J 


Consequently, the second term in formula (52.5) can be written as 


Let us transform the integral /" by formula (XI. 60): 


If a magnetic occupies a finite volume or M(r') diminishes sufficierr- 
tly rapidly with an increasing distance from the origin of coordi- 
nates, the last integral vanishes [when a magnetic is localized in a 
finite volume, the integration surface can be chosen outside the 
magnetic, and in this case M(r') = 0 everywhere on the surface}. 

Hence, in the presence of magnetics, the vector potential is deter- 
mined by the expression 


A(r) = /, + /; = j J 


i(r')dk' 
|r — r' 1 


r |V', M (r')| dV' 

! |r — r'| 

1 f i (r') + c [V'. M (r')| 
e J Ir-r'| dV 


(52. 6> 


The result we have obtained may be interpreted to signify that 
the magnetization makes the same contribution to the vector poten- 
tial as the current of density 

j M = c [VM] (52.7) 

(we have omitted the prime on V, and accordingly M is considered as- 
a function of r instead of r'). It follows that in the presence of magnet- 


13* 
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ics, Eq. (48.3) must be written as 

[VB] = -- j + 4n [ VM] (52.8) 

Combining the terms containing a curl, we obtain 
[V, (B-4nM)] = ^- j 

The quantity 

H = B — 4nM (52.9) 

is called the strength of a magnetic field. It is an auxiliary macro- 
scopic characteristic of a magnetic field similar to the electric dis- 
placement D [see (45.2)]. The following equation holds for H: 

[VH] = i2.j (52.10) 

Experiments show that for diamagnetics and paramagnetics the 

magnetization is proportional to the field strength: 

M - XmH (52.11) 

(The magnetic is assumed to be isotropic. In addition, we have in 
mind fields for which the magnetization is far from saturation.) 

The quantity % m is known as the magnetic susceptibility of a sub- 
stance. It is positive for paramagnetics and negative for diamagnet- 
ics. Substitution of expression (52.11) into formula (52.9) yields 

B = H + 4n Xm H = fiH (52.12) 

where 

(x == 1 + An% m (52.13) 

is the permeability of a substance. 

Relations (52.11)-(52.13) are also used for ferromagnetics, treating 
and jx as functions of the field strength H. 

The*expediency of introducing H is due to the same considerations 
that were set out in Sec. 45 to substantiate the expediency of intro- 
ducing D. 

Consider a field in a homogeneous isotropic magnetic. Assume that 
in the absence of the magnetic the given conduction currents produce 
n field characterized by the induction B 0 and the potential A 0 . It is 
inown that B 0 and A 0 are solutions of the equations 

[VB 0 ] = -^-j 

. » 4 rt • * 

AA 0 = — J 


[see (48.3) and (48.4)]. 


(52.14) 

(52.15) 
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Now, without changing the conduction currents, let us fill the 
entire space in which the field differs from zero with a homogeneous 
(p = const) isotropic magnetic. The field induction will therefore 
become equal to B, and the potential will become A. Let us write 
equations for B and A. By Eq. (52.10), we have 

[VHj = [Vy]=4 L j (52-16) 

Substituting EvAl for B in this equation [see (47.5)1 and taking into 
account that p = const, we can write 

f IV. IVA]] = ^j 

Let us develop the left-hand side of the expression obtained by for- 
mula (XI. 45). The result is 

i-{V (vA)-AA} = ^-j 

But we have agreed to choose A so that VA vanishes [see (47.7)1. 
Hence, the first term in braces vanishes, and we arrive fat the 
equation 

A T = ~T Lj (52,17) 

A comparison of (52.16) with (52.14) and (52.17) with (52.15) 
shows that the equation for B/p coincides with that for B 0 , and the 
equation for A/p with that for A 0 . This shows that the filling of a 
space containing a field with a homogeneous magnetic leads to an 
increase in both the magnetic induction and the magnetic potential 
of p times. 

By Eq. (52.17), Poisson’s equation (48.4) for a field in a homoge- 
neous isotropic magnetic has the following form: 

AA= — ISij (52.18) 

When a homogeneous isotropic magnetic with p = const (i.e. 
with p not depending on H) fills the entire space in which the field 
differs from zero, the following relation holds: 

H = -J-[VA] = [VA'| (52.19) 

where A' = A/p. 

We shall underline the fact that the field strength H can be repre- 
sented in the form of the curl of the function A/p only when the 
magnetic is homogeneous and p = const. The magnetic induction 
B, however, can always be written as B — [vA] because VB = 0 in 
any conditions. 
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Recall that at the boundary between two magnetics, the vectors 
B and H must satisfy the following conditions: 

h‘ - h" 1"h' - nH*'} (52 ' 20) 

fi xi — it t 2» — M , a^n2 > 

A derivation of these boundary conditions can be found in textbooks 
of general physics. 

In anisotropic magnetics, the relation between M and H is given 
by the expressions 

(i = 1,2,3) (52.21) 

ft=i 

where Xm, a< is a symmetric tensor of rank two called the magnetic 
susceptibility tensor. 

The equations relating the vectors B and H can accordingly be 
written as follows: 

B,= S li ik H k (i = l, 2, 3) (52.22) 

h= 1 

where the quantities 

P-ih — &ih + 4n Xm,lh (52.23) 

are the components of the magnetic susceptibility tensor (compare 
with the formulas in Sec. 46). 
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53. Law of Electromagnetic Induction 


A change in the magnetic induction flux through a closed contour 
T is attended by the appearance of an induced e.m.f. in the circuit 
equal to 


% = 1 d(P _ 


c dt 


r 

c dt J 


B dt 


(0 is the magnetic flux penetrating the contour). If the surface over 
which the integral is evaluated does not vary with time, the opera- 
tions of differentiation with respect to t and integration over the 
coordinates may exchange their places. Hence, the expression for 
can be written as 

*.=-f ( 53 -‘) 

(we have written the partial derivative with respect to t because B, 
generally speaking, is a function of both time and the coordinates). 

The e.m.f. by definition is the circulation around a given contour 
of the field strength vector E ext of extraneous forces. In the present 
case, the strength E of a vortical electric field produced by the vary- 
ing magnetic field B is the strength E ext . Consequently, 



(53.2) 


(we have used the Ostrogradsky-Gauss theorem). 

Equating the right-hand sides of expressions (53.1) and (53.2), 
we arrive at the relation 


J [VE]df=-i- j -f-df (53.3) 


Assume that both integrals are taken over the same surface [Eq. 
(53.3) is also observed for different surfaces if only they rest on the 
same contour T]. Relation (53.3) must be satisfied for any surface /. 
This is possible only if the integrand functions have the same value 
at every point of space. We thus arrive at the equation 


'[VE] = 


;i, <?b 

c dt 


(53.4) 
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The equation we have obtained does not include the parameters of 
the contour with whose consideration we began this section. It is 
natural to assume that this equation must be observed for any point 
of a field regardless of the presence in this field of a physically sepa- 
rated (particularly a conducting) contour. This assumption is known 
to be justified experimentally. 

Examination of Eq. (53.4) shows that a time-varying magnetic 
field B must set up an electric field E. Indeed, for [VE] to be non- 
zero, the presence of a non-homogeneous (i.e. varying from point to 
point) field E is needed. 

54. Dispiacemenf Current 

In analysing the equations describing electromagnetic phenomena, 
J. Maxwell gave attention to the fact that in the non-stationary case, 
the equation 

[VH] = — j (54.1) 

[see (52.10)] is incompatible at dp/dt =£ 0 with the continuity 
equation 

Vi-— g- (54.2) 

[see (51.1)]. To convince ourselves that this is true, let us take the 
divergence of both sides of Eq. (54.1). Since the divergence of a curl 
is always zero [see (XI. 44)], we arrive at the conclusion that the 
divergence of j and, consequently, dp/dt too, cannot be non-zero. But 
the conclusion that dp/dt always equals zero does not agree with ex- 
periments: in non-stationary processes, the density of the charges 
quite often varies with time. 

Equations (54.1) and (54.2) can be brought into agreement by 
adding in (54.1) to j a quantity (we shall denote it j d ) having the di- 
mension of current density. This quantity must be determined so 
that the condition 

V (j + id) = 0 

will always be satisfied. It follows from this condition with a view to 
(54.2) that the addend j d must satisfy the relation 

Vj d =-Vj = |f 

Time differentiation of Eq. (45.3) yields the equation 

± { VD)-in%- 


(54.3) 
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or, changing the sequence of differentiating D with respect to time- 
and the coordinates, 



Substituting the expression obtained from this equation for dp/dt 
into formula (54.3), we arrive at the equation 


V3d= 4ir v (irj 


It follows from this equation that the expression 

. 1 3D 

3d 4 ji dt 


(54.4) 


must be taken as j d [in the general case the arbitrary function of 
time / ( t ) must be added to the right-hand side, but we have as- 
sumed that it equals zero]. 

Maxwell thus arrived at the conclusion that for a time-varying 
field Eq. (54.1) must be written as 


or 


1 VH] = ~~ j + ~ (54.5). 


He called the quantity j d the density of the displacement current 
and the sum j + j d the density of the total current. 

A glance at Eq. (54.5) shows that a magnetic field may be pro- 
duced not only by conduction currents, but also by a time-varying 
electric field. Hence, the introduction of the displacement current 
made the fields E and B equivalent with respect to the ability to- 
produce each other. 


55. Maxwell's Equations 

The collection of Eqs. (45.3), (47.4), (53.4) and (54.5) forms a sys- 
tem of Maxwell’s equations. It is customary practice to group these 
equations in pairs. The equations 

tVE] = -4-f- (55-1)' 

and 

VB = 0 (55.2) 

are known as the first pair of Maxwell's equations, and the equations 


, 1 3D 
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and 

VD = 4itp (55.4) 

as the second pair of equations. We must note that the first pair of 
equations includes only the basic characteristics of a field, namely, 
E and B. The second pair contains the auxiliary quantities D and H. 

Maxwell’s equations are the foundation of all electrodynamics. 
They play the same role in classical electrodynamics that Newton’s 
equations do in classical mechanics. 

The system of Maxwell’s equations includes eight scalar equations 
[each of the vector equations (55.1) and (55.3) is equivalent to three 
scalar ones] containing twelve unknown scalar functions (the com- 
ponents of the vectors E, B, D and H). Therefore, Eqs. (55. 1 )-(55.4) 
by themselves are insufficient for determining the electromagnetic 
fields in a substance. In this sense, the system of Maxwell’s equations 
is incomplete. To be able to analyse fields, Maxwell’s equations must 
be supplemented with equations relating the quantities D, j and E, 
and also H and B, with one another. These equations (sometimes 
called equations of state) have the form [see (45.5) and (52.12)] 

D = eE. (55.5) 

B = pH (55.6) 

j = crE (55.7) 

(here or is the electrical conductivity of the medium). To solve prob- 
lems, we must know the characteristics of the medium, e, p, and a, 
which are constants in the simplest case. 

We must note that Maxwell’s equations can be written so as not 
to contain the auxiliary quantities D and H. For this purpose, we 
must replace these quantities with their values from (45.2) and (52.9) 
in the second pair of equations. We invite our reader to convince 
himself that after such a replacement Maxwell’s equations become 

[VE]=— r^T. vB = ° < 55 ’ 8 ) 

[ V B] = -^-(j + C [vM] + -^-)+-i^, VE = 4n (p — VP) (55.9) 

,To solve the system of equations (55.8), (55.9), we must know the 
form of the functions P = P (E), M = M (B), and j = j (E). 

In the following, we shall have to do with electromagnetic fields 
in homogeneous and isotropic media whose permittivity e and per- 
meability (A are constant. Here e and p can be put outside the signs 
of the derivatives or, conversely, can be put inside these signs. Hence, 
Eqs. (55.1)-(55.4) can be written as 

[VE] = — ^jr ’ VB = 0 

(VBJ-^J + ^f-. VEi^p 


(55.10) 

(55.11) 
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In the absence of media that can be polarized and magnetized, e 
and p equal unity, so that Maxwell’s equations for a field in a 
vacuum are as follows: 

IVE]=— V B = 0 (55.12) 

= + VE = 4np (55.13) 

Since in the Gaussian system of units H coincides with B for a field 
in a vacuum, in this case we may write H instead of B in Eqs. (55.12) 
and (55.13). 

56. Potentials of Electromagnetic Field 

In Sec. 47, taking advantage of the fact that VB = 0, we wrote 
the magnetic induction as 

B = [VA] (56.1) 

where A is an auxiliary function known as the vector potential. 
Expression (56.1) also holds for a time-varying field. In this case, 
however, A must be considered as a function of not only the coordi- 
nates, but also the time t, namely, A — A (r, f). 

Let us substitute (56.1) into Eq. (55.1). The result is 

IVE] - ~ ! IVAJ = -4[V, £-] 

This relation can be written as 

fv ( e + 4-3t)]-° 

Since the curl of the vector E + (1/c) dA/dt vanishes, this vector 
can be written as the gradient of a function <p: 

E+ 4'4r = ~ V(p (56 ’ 2 > 

The function <p is called the scalar potential of an electromagnetic 
field. In the non-stationary case, it depends on r and on f. It can be 
seen from^(56.2) that the potentials 9 and A have the same dimen- 
sion. 

By (56.2), we have 

E-_V?-ff (56.3) 

The electric field strength in the general case is thus determined 
not only by the scalar potential <p, but also by the vector potential 
A. The second term in (56.3) is evidently due to electromagnetic in- 
duction. For a stationary field, dA/dt = 0, and formula (56.3) 
transforms into (41.5). 
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Relations (56.1) and (56.3) express the magnetic and electric 
fields in terms of the vector and scalar potentials. 

Let us find equations that will allow us to calculate the potentials 
A and cp for a field in a homogeneous and isotropic medium with 
constant e and p. For this purpose, we shall substitute in Eqs. (55.11) 
expression (56.3) for E and the curl of A for B: 

IV. |VAl) = ^i 

V(V*) + -fv (•$-)= -l£p 

Taking into account that [V, tvA]] = V (VA) — AA [see (XI.45)], 
and V (V<p) = A<p, we can write the equations we have obtained as 
follows: 

^-^“-^i + vfvA+a-fL) (5 6.4) 

4<p=-ia._i.^.( V A) (56.5) 

(in some terms of the equations we have changed the sequence of 
differentiation with respect to the coordinates and time). 

Equations (56.4) and (56.5) are exactly the ones we are looking 
for to find A and <p. These equations are quite complicated. The fact 
that they are mutually related— each of them contains both A and 
q> — is especially unpleasant. We shall see below, however, that the 
potentials can be selected so that these equations become greatly 
simplified. 

The potentials A and <p are determined non-uniquely. There is 
consequently a certain freedom in choosing them. Particularly, an 
arbitrary constant vector may be added to A, and an arbitrary con- 
stant to (p without any change in the values of B and E. Naturally, 
the potentials should be chosen in the most convenient way for the 
given case. This most expedient choice of the potentials is known as 
their gauging. We must note that we have already taken advantage 
of the possibility of gauging in magnetostatics: in Sec. 47 we chose 
A so that its divergence would be zero [see formula (47.7)]. 

Let us determine the most general form of the gauge transforma- 
tions, i.e. such transformations of the potentials A and <p at which 
the fields E and B remain constant. The field B = [VA] will not 
change if we add to A the gradient of an arbitrary scalar function / 
(the curl of the gradient is zero), i.e. pass over from A to A' equal 
to A + V/: 

A A' = A + V/ (56.6) 

For the electric field E = — V<p — (1/c) dA/dt to remain unchanged 
here, simultaneously with the transition (56.6) we must perform 
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the transition 

<P-<P' = <P -f|f ( 56 ‘ 7 ) 


where / is the same function as in (56.6). Indeed, the field E' deter- 
mined by the potentials A' and cp' will in this case be 


E'= — V<P' 


1 dA' 
c dt 


-V9 + t 



1 dA 
c dt 





(recall that V (dfldt) — d (V/)/dH Hence, the transformation of the 
potentials by formulas (56.6) and (56.7) does not change the value 
of B and E. 

The transformations (56.6) and (56.7) are gauge transformations 
of the most general kind. Since the fields B and E remain unchanged 
in these transformations, all the equations describing these fields 
must be invariant with respect to the gauge transformations. This 
invariance is called a gauge invariance. 

In practice, as we have already mentioned, the gauge that is the 
most expedient in each specific case is used. Particularly, we can 
choose the potentials so that the following condition will be observed: 

V A + — -^ 7 - = 0 (56.8) 

c dt 


It is called the Lorentz condition. 

For a field in a vacuum, the Lorentz condition is 

vA +4Sr=° < 56 - 9 ) 


We shall show that condition (56.8) can be satisfied by the proper 
choice of the function / in formulas (56.6) and (56.7). For this pur- 
pose, we shall introduce the values of A' and cp'determined by these 
formulas into Eq. (56.8): 


VA + A/ -f- ~ 


dtp 

IT 


SU 



0 


(V (V/) = A/]. Hence, we obtain an equation for finding the func- 
tion /: 




a 2 / 

dt* 


= F(r, t ) 


(56.10) 


where F (r, t) = — VA — (ep/c) dtp Idt is the preset function of r 
and t. Introducing the function / obtained from the solution of this 
equation into formulas (56.6) and (56.7), we shall find the values of 
the potentials A' and <p' satisfying condition (56.8). 
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The gauging of potentials satisfying condition (56.8) is called 
the Lorentz gauge. This gauge is in the greatest favour. 

The Lorentz condition greatly restricts the set of potential values 
suitable for describing a given field, but it nevertheless makes the 
choice of the potentials quite unique. Indeed, without violating 
condition (56.8), we can perform the transformations 


A— »-A' = A+V$ i 
cp-^cp = < P~ ~ J 


(56.11) 


(both sets of potentials — A, <p and A', <p' — are assumed to satisfy 
the Lorentz condition), where the function ip is the solution of the 
equation 


Av|) 


ep 

~~dt? 


0 


(56.12) 


Indeed, introducing into the left-hand side of (56.8) the primed po- 
tentials from (56.11) instead of A and <p, we obtain the expression 


VA + AtJ)H- 


ep 

c 


d<p 

dt 


ep 






(VA- 


ep 


dtp 

dt 


)+(*♦-?■ 43 -). 


which owing to (56.8) and (56.12) equals zero. Therefore, if the po- 
tentials A and <p belong to the Lorentz gauge, the potentials A' and tp' 
determined by transformations (56.11) [with ij) obeying (56.12)] belong 
to the same gauge. This allows us to impose on the potentials an addi- 
tional condition besides condition (56.8). For instance, we can re- 
quire that the potential <p vanish. For this purpose, according ( to the 
second of Eqs. (56.11), it is sufficient to choose the function ip so that 
its time derivative will be c< p. 

We can also adopt the requirement that 

VA = 0 (56.13) 


as an additional condition. It follows from (56.11) that VA' — 
= VA + Aij). Therefore, for the requirement that VA' = 0 to be 
satisfied, the equality 

Ai|) = — VA 

must be observed. At the same time, from (56.12), we have 



d*y\> 
dt 2 


Therefore, if we take' as i|) the solution of the equation 

ep 5H' _ 


-VA 
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and introduce this solution into (56.11), we obtain values of A' 
and <p' satisfying both the Lorentz condition" and the requirement 
(56.13). 

The Lorentz gauge satisfying the additional condition (56.13) 
is called the Coulomb (or transverse) gauge. This is the gauge we 
used in Chap. IX [see (47.7)]. Examination of (56.5) shows that in 
the Coulomb gauge the scalar potential satisfies Poisson’s equation 


Aq> = 


4np 

s 


[see Eq. (45.13)], i.e. is a Coulomb potential. This explains the ori- 
gin of the name “Coulomb gauge”. 


57. D'Alembert's Equation 


When condition (56.8) is satisfied, the last term in Eq. (56.4) van- 
ishes. In addition, the time derivative of] VA has the value 
— (ep/c) d 2 (f/dt 2 . Consequently, Eqs. (56.4) andf (56.5)' become 


AA 


eu d 2 A 





(57.1) 

(57.2) 


Hence, instead of two mutually related equations, we have ob- 
tained two independent equations, the equations for A and cp having 
acquired a similar form. 

A differential equation of the kind 

*> < 57 - 3 > 


is known as d’Alembert’s equation [compare with (56.10)]. It can 
be written in a very compact form if we introduce the d’Alembertian 
operator 


n _ A JLj- iiLJi 

U — c 2 dl*~ <?x 2 dy 2 ^ dz 2 c 2 dt 2 


(57.4) 


Now Eq. (57.3) becomes 

□/ = F (r, t) (57.5) 

In the stationary case, the time derivatives vanish, and D’Alem- 
bert’s equation transforms into Poisson’s one [see Eqs. (45.13) and 
(52.18)]. 

When we use the symbol (57.4), Eqs. (57.1) and (57.2) become 


□ A — 


c 


i 


(57.6) 


4np 


(57.7) 


□ <P = 


6 
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From a mathematical viewpoint, Eqs. (57.6) and (57.7) are simpl- 
er than Maxwell’s equations. It is therefore simpler to calculate the 
potentials A and (p than the fields B and E directly. When the poten- 
tials are known, there is no great difficulty in finding the fields by 
formulas (56.1) and (56.3). This circumstance justifies the introduc- 
tion of the auxiliary quantities A and <p. In addition, as we shall 
see in the following chapter, the introduction of the potentials allows 
us to give the equations of electrodynamics a very compact and re- 
fined form. 

58. Density and Flux of Electromagnetic 
Field Energy 

Experiments show that an electromagnetic field has an energy 
that propagates in space with a certain density w and can flow from 
one place to another. This energy can also transform into other kinds 
of energy, for example, it can go to do work on the particles of a 
substance. 

Let us separate in a substance in which there is a macroscopic 
electromagnetic field the volume V confined by the surface /. This 
volume contains energy of the field equal to 

W= \ wdV (58.1) 

v 

The energy can flow outward from the volume V through the sur- 
face / confining it. If we introduce the vector S of the energy flux 
density, the flux of energy flowing outward through / can be writ- 
ten as 

0= J$ n d/= f vSdV , (58.2) 

i v 

(we have used the Os trogradsky- Gauss theorem). 

Let us find the work done in unit time by the field forces on the 
particles of a substance. The forces of a field do the following work 
on a paTticle having the charge e a and travelling at the velocity v 0 
in unit time: 

N a = e a {E +~[v a B]j v a = e a v a E 

(the scalar triple product v a [v a B] vanishes). Summating this expres- 
sion over all the particles confined in unit volume, we obtain the 
density of the power developed by the forces of the electromagnetic 
field in doing work on the particles of the substance. Designating 
the power density by N, we can write 

jr-Vtf.-E £ e«v Q 

V=1 
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The sum 2e a v a taken over unit volume is the density j of the elec- 
tric current (if all the particles are identical, travel at the same ve- 
locity, and their number in unit volume is n, the sum 2e a v a trans- 
forms into the expression j = en\ known from the general course 
of physics). Hence, 

N = Ej (58.3) 

The energy W contained in the volume V may diminish because 
of energy flowing outward through the surface / and because of work 
being done on the particles of the substance. Consequently, the fol- 
lowing relation must be observed: 

v 

(remember that N is the power density, i.e. power developed per unit 
volume). Substitution into the above equation of expressions (58.1)- 
(58.3) for W, d>, and N yields 



Let us exchange the places of time differentiation and integration 
over the coordinates in the first integral on the right-hand side, and 
also combine all three integrals into one. The result is 

j ( -jj- + VS -f Ej )'dF — 0 (58.4) 

v 

(we have used the symbol of the partial derivative because w in the 
general case is a function not only of time, but also of the coordi- 
nates). 

Condition (58.4) must be observed for any arbitrarily chosen vol- 
ume V. We thus conclude that the integrand function must vanish 
at every point. Consequently, we arrive at a differential equation 
that can be written as 

Ej ~ — jf — VS (58.5) 

The energy density w and the energy flux density S are functions 
of quantities characterizing a field. To find the form of these func- 
tions, let us) attempt to transform the expression for Ej so that it 
would become the sum of two addends, one of which would be the 
time derivative of a scalar quantity, which we would be able to 
identify with w, and the second— the divergence of a vector quantity, 
which we would be able to identify with S. 


14-018 
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Using Maxwell’s equation (55.3), let us express j in terms of the 
field characteristics H and D: 


j = 


c 

4 it 


IVH]— et 


<5D 

dt 


Now let us find the scalar product of this expression and E: 


Ej = ^E(VH]- 



<9D 

dt 


(58.6) 


By formula (XI. 28), we have V (EH] = H [vE] — E [yH], whence 
E [\7H] = H [VE] — V [EH]. Introducing this value into (58.6), 
we obtain 

Ej=-^V(EH! + {^H[vE]--LEi?-} (58.7) 

Using Maxwell’s equation (55.1), let us substitute —(i/c) dB/dt for 
[VE]. As a result, the expression in braces becomes 


Finally, let us use relations (55.5) and (55.6): 

, 1 f B d(nH) , v d(eE) ) d i zE* 

'• • • > - ~ IT H — W' + E ST- / = “ W ( “sF 


VH* \ 
8ji / 


(we have assumed that e and p are time-independent). 
Formula (58.7) can thus be written as follows: 


A comparison of the obtained relation with formula (58.5) gives for 
w and S the expressions 

w = lEl+Ml (58.8) 

S = ^[EH] (58.9) 


The vector S determined by formula (58.9) is called Poynting’s 
vector. 

We must note that expression (58.8) includes both the proper ener- 
gy of the field equal to 


w 0 


E 2 +H 2 

8jz 


and the energy spent on polarization and magnetization of the medi- 
um in producing the field. 
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59. Momentum of Electromagnetic Field 

It follows from tire existence of the pressure of light that an electro- 
magnetic field has not only energy, but also momentum. The mo- 
mentum, like the energy, can ‘‘flow” from one place to another. 
This process can be characterized by introducing the concepts of the 
flux and flux density of the momentum. 

The momentum flux, unlike the energy flux, is a vector, and not a 
scalar. Consequently, the density of the momentum flux must be a 
quantity such that when multiplied by the vector di (here df is an 
element of the surface) it yields a vector. It is shown in Appendix X 
that the scalar product of a second-rank tensor and a vector is a 
vector. We thus conclude that the density of the momentum flux is 
a tensor. Let us designate the components of this tensor by the sym- 
bol a ik . Therefore, the flux of the z-th component of the momentum 
through the area di is determined by the expression 

3 o tk df k 

h 

and the flux of the momentum vector by 

5 e i 2 a lh df k 

i k 

In a system consisting of free charged particles and an electromag- 
netic field, the total momentum, which is the sum of that of the 
particles and that of the field, must be conserved. Hence, denoting 
the total momentum of the particles by the symbol P, and the den- 
sity of the momentum (i.e. the momentum of unit volume) of an 
electromagnetic field by the symbol g, we can write 

4r( p +\sdv) = 2 e i §2°ikdfk 

v i / k 

The left-hand side gives the rate of growth of the total momentum 
contained in the volume V, and the right-hand side — the flux of the 
momentum of the field flowing into the volume V through! the sur- 
face / confining it. We assume that no particles intersect this sur- 
face so that no momentum is carried across it by the particles. 

Let us write the above relation as follows: 

fr p -~ \-w dV + 2 «, § 2 ■>,„ #, (59.i) 

v i f h 

The rate of the change in the momentum of a particle is deter- 
mined by the force acting on a particle: 

•§-_ eE + A[vB] 


14 * 
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Summation of this expression over the particles contained in a unit 
volume yields 


*- = pE + -t[iB] 


where p 0 is the density of the momentum of the particles. Finally, 
integrating this expression over the entire volume of the system, 
we find the rate of the change in the total momentum of the par- 
ticles: 



jpEdF-f-1 J HB] dV 
v v 


(59.2) 


Let us use Maxwell's equations to exclude p and j from this expres- 
sion. Assume that the medium containing the particles and the field 
are homogeneous and isotropic with constant e and p. It can be seen 
from Eqs. (55.11) that 


p = 


e 

4n 


VE, 


i=4^fVB!-4F 


dE 

dt 


Substitution of these values into (59.2) yields 


£ p = ir J EvE dv +^r I « VB '- 

We shall transform Eq. (59.3) using the relation 


It*-"]*' 

(59.3) 


ilEB)=[#.B| + [E. -£] 

whence 

Ltt- b ]“4'I eb i-[ e ' -f-J 


We substitute — c (vE] for dB/dt here according to the first of the 
equations (55.10). The result is 

B l = 4f EB J + c [ E ’ IVBJJ 

Using the value obtained in (59.3), we arrive at the expression 

£ p =^r jEvEiF+^jKvm, bj^ 

— ST \ IT t EB l dr — E- j (VEJI W 

Let us make the substitution B = pH in the second and third inte- 
grals and, in addition, exchange the places of the multipliers in the 
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second integral (which will cause the sign to be reversed): 

4- p =^r 5 EvBdT'— JL- J m, [VH ]) dV 

[EB] J IE. IVEJ] dV 

To make this expression symmetric with respect to E and H, let 
us add (p/4rc) J H VH dV to its right-hand side. This will toot change 

the expression because this term is zero [pVH = V (,uH) = vB is 
zero everywhere]. Finally, grouping the terms properly, we obtain 

£ p =-TsrJir[ EH i“ v 

+ j {eEvE + pHvH - e [E, [VE]] - ^ [H, [VH]]} dV (59.4) 

The second integral, as we shall see below, can be transformed into 
an integral over the surface / confining the volume V. Hence, 

-^-P = — ^ {‘|^r[EH]j dV -\- integral over surface / (59.5) 

v 

A comparison of the found relation with (59.1) allows us to draw 
the conclusion that the density of the momentum of an electromag- 
netic field is determined by the expression 

g-i-[EH) (59.6) 

Taking (58.9) into account, we can write that 

g=#S (59.7) 

where S is the Poynting vector. For a vacuum, this relation is 

g = ^s (59.8) 

We must note that expression (59.7), in addition to the proper mo- 
mentum of a field, includes the momentum of the bound charges en- 
tering the composition of the medium in which the field has been 
produced. To obtain the momentum of the field alone, we must un- 
derstand P in formula (59.1) to be the mechanical momentum not 
only of the free charges, but also of the bound ones. Now we would 
have to take not the averaged macroscopic field, but the microscopic 
field as E and B and correspondingly use Maxwell’s equations for 
a field in a vacuum in the transformations h This is equivalent to 

1 We did not do this from the very beginning to obtain a more general expres- 
sion for the Maxwell stress tensor (see below) that is also suitable for the field 
in a medium. 
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assuming in all the formulas of the present section that e = p. — 1. 
As a result, we would arrive at formula (59.8). Consequently, the 
density of the momentum of only a field in all cases (both in a vac- 
uum and in a substance) is determined by formula (59.8). A com- 
parison of this formula with formula (40.36) shows that there is 
exactly the same relation between the densities of the energy and 
momentum fluxes for an electromagnetic field as we obtained in 
Sec. 40 for an arbitrary system. 

Now let us consider the second integral in formula (59.4), i.e. the 
integral 

-EP j {eEvE + uHvH — e[E, [VE]]-^[H, [VH]]}dF (59.9) 

We shall attempt to transform it into a surface integral. 

The integrand contains two similar expressions of the kind 

ava — [a, [Vail 

One of them contains E instead of a, and the other H. 

Assuming in the formula 

V (ab) = [a, tvbll + [b, [Va]] + (aV) b + (bv) a 
[see (XI. 37)] that b = a, we obtain 

Va 2 = 2 [a, [Vail + 2 (aV) a (59.10) 

Let us find the value of the expression (Va) b in which it is as- 
sumed that V acts on both factors following it. According to the general 
rule for calculating such expressions, we have 

(Va) b = (V a a) b + (V b a) b = bVa (aV) b 
or, assuming that b = a, 

(Va) a — ava + (aV) a 

After finding (aV) a from the last expression and introducing its 
value into formula (59.10), we obtain 

Va 2 = 2 [a, [Vail + 2 (Va) a — 2aVa 

whence 

aVa— [a, [Va]J = (Va)a — $~Va 2 

Applying such a transformation to the integral (59.9), we can 
write it as follows: 

-±r J {e (VE) E + (VH) H + -^-V (eE* + fi/F)} dV 

In this integral, the operator V acts on all the functions following it« 
Consequently, with the aid of the transformation 

dF-V-> <*f 
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[see (XI. 65)], we can transform this integral into a surface one: 
- 4 § {eE (E di) + pH (H rff) - ± (e £ 2 + p# 2 ) iff} (59.11) 

[since df does not have the properties of a differential operator, we 
can exchange the places of the factors in expressions such as (iff a) al. 

Let us develop expression (59.11) using the components of the 
vectors in it. The result is 

2 e * § {* E i 2 E * d fx + vffi 2 d ik - 4 - ( e£2 + ^ d ft} 

i ft ft 

Let us write df t in the last term as 2^ift dfk and factor out df k . Let 

ft 

us also substitute D t for expressions of the kind zEi, and 5f for 
\xHt. As a result, we arrive at the expression 

2 e i § 2 { E i°k + ViBk — i (ED + HB) 6 ift } df h 

i ft 

that coincides with the last term of formula (59.1) if we assume that 

CT ‘-fc = { E i°k + < ED + HB ) M ( 59 - 12 ) 

We established at the beginning of this section that the tensor cr ift 
characterizes the density of the momentum flux (see also Sec. 40). 
The tensor a ih whose components are determined by formula (59.12) 
is called the Maxwell stress tensor. 

To underline the symmetrical nature of the tensor Gj fe , its compo- 
nents are sometimes written as 

o ik = + E * D t + H > B * + H * B i ~ < ED + HB > 

For a field in a vacuum, formula (59.12) is simplified as follows: 

o ik ^^{E i E h ^B l B h -\{E^B^ lh ) ‘ (59.13) 

The tensor a ik allows us to reduce the problem of finding the force 
acting on a certain volume of a substance in an electromagnetic 
field to the calculation of the surface integral 

2 e i § 2 dfk 

i f h 
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EQUATIONS OF ELECTRODYNAMICS 
IN THE FOUR-DIMENSIONAL 
FORM 


60. Four-Potential 1 

According to the principle of relativity, the equations of t electro- 
dynamics, like all other equations expressing nature’s laws, must be 
relativistically invariant, i.e. retain their form in Lorentz trans- 
formations (when passing from one inertial reference frame to an- 
other). Direct verification shows that Maxwell’s equations meet this 
requirement. We shall choose a different way, however— we shall 
show that the equations of electrodynamics can be written in the 
four-dimensional form as relations between four-vectors and four- 
tensors, whence their relativistic invariance will follow. 

Our starting point will be the thesis adopted in electrodynamics 
(in accordance with experiments) that the electric charge is invariant, 
i.e. that the magnitude of the charge of a particle is the same in all 
inertial reference frames. It thus follows that the quantity p dV is 
also invariant: 

p dV = p dx 1 da? da? — inv (60.1) 

We know that the three-dimensional volume dV is not invariant 
[see formula (35.17)]. We thus conclude from (60.1) that the charge 
density p is also not invariant, but changes in a transition from one 
reference frame to another according to a definite law. To establish 
this law, we shall take into consideration that a four-dimensional 
volume is invariant: 

dV* = dx° dx 1 da? da? = c dt dV = inv (60.2) 

Indeed, in passing over to another reference frame, dt and dV 
transform by the formulas 

r ft*- dt dF' = dF]/l-y 2 /c 2 

Yi-vVc*', 

so that dt' dV' = dt dV. 


1 In this chapter, we treat fields in a vacuum, i.e. we assume that e = 1 and 
u = 1. 

We advise our reader to look through Appendix XII and Chap. VII before 
beginning to read this chapter. 
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A comparison of (60.1) and (60.2) shows that p transforms accord- 
ing to the same law as dx°, i.o. like the time component of a four- 
vector. 

It is shown in Appendix XII that the contravariant components 
of the Hamiltonian four-operator are 

vMt-f- -v) 

[see formula (XII. 45)]. 

By (XII. 38), the symbolic square of the vector y* is 

= (60.3) 

where A is the three-dimensional Laplacian operator. 

A comparison of (60.3) and (57.4) shows that the .d’Alembertian 
operator for a field in a vacuum (e = 1, fi = 1) differs from V* 2 
only in its sign: 

V * 2 = — □ (60.4) 

It follows from everything said above that Eq. (57.7) can be writ- 
ten as 

y* 2 <p = 4np (60.5) 

We have ascertained above that p has the properties of the time 
component of a four-vector, and V* 2 , like the square of any four- 
vector, behaves in Lorentz transformations like an invariant. Tak- 
ing this into account, we conclude on the basis of (60.5) that the 
potential qp must transform according to the same law as p, i.e. as a 
four-vector time component. 

Consider the current density vector j = pv. Its components are 

; ft = P^ = P^- (*==1,2,3) (60.6) 

In transformations, p behaves like ct or c dt. Consequently, will 
behave like dx k (where k — 1, 2, 3), i.e. like space components of 
a four-vector. 

Therefore, in transformations of the coordinates, p behaves like 
a four-vector time component, and the quantities j h like four-vector 
space components. Hence, multiplying p by the scalar c (to obtain a 
quantity of the same dimension as j k has), we can combine p and j 
into a single four-vector called the charge-current or simply the cur- 
rent four-vector. Its components are 

7° = cp, j 1 = ] x , f = j v , f = U 

which can be written briefly as follows: 

F = (cp, j) 


(60.7) 
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We must note that the component ;'° = cp can’be written like the 
components (60.6) as 


(x° = ct). Therefore, the components of the current four-vector can 
be determined as follows: 


j' l = P 


dx * 1 

dt 


(p = 0, 1, 2, 3) 


(60.8) 


The continuity equation following from charge conservation is 
(see (51.1)1 


Vj + 


£p 

dt 


= 0 


With a view to (60.7), this equation can be written in the four-di- 
mensional form: 

iS-° (60 - 9 > 

n=0 


The left-hand side of relation (60.9) is the four-dimensional diver- 
gence of the current four-vector. Indeed, by analogy with the three- 
dimensional divergence, the four-divergence of the vector a?- must 
be determined as the scalar product of the vectors V* and a* 1 , i.e. as 


13 


2 


n=o; 


W-— 01 


|i=0 


(see expression (XII. 42) for the covariant components of the vec- 
tor v*l. 

That the four-divergence of the vector / 4 equals zero is an analyt- 
ical expression of the law of charge conservation. The fact that the 
vector -(60.7) which we have introduced satisfies such a simple con- 
dition is another argument in favour of combining p and j into a 
single four-vector. 

Having in view that □ == —V* 2 , let us write Eqs. (57.6) and 
(57.7) as follows: 

v *2A = ^-j (60.10) 

y*2q,__^L ( C p) (60.11) 


Examination of Eq. (60.10) shows that the quantities A h behave in 
the same way as the quantities /'*, i.e. like space components of a 




EQUATIONS OF ELECTRODYNAMICS IN FOUR-DIMENSIONAL FORM' 2^9 


lour-vector. This circumstance allows us to combine 9 and A into a 
single four-vector: 

A» = ( 9 , A) (60.12) 

known as the four-potential of an electromagnetic field. Equations 
(60.10) and (60.11) can therefore be written as a single equation: 

= yu (60.13) 

or 

j v- (n = 0, 1, 2, 3) (60.14) 


The covariant components of the four-potential are as follows: 

= ( 9 , -A) (60.15) 

We remind our reader that the four-potential is determined non- 
uniquely. By (47.6), the values of the space components A k can be 
replaced with the quantities 

A k = A k + ^ (l1 ’ f ’ ■■ ■#■ (60.16) 

dx « 

and the value of the component A 0 with the quantity 

I 0 = A 0 + C (60.17) 


[C is a constant; see the text following formula (41.5)] without 
changing the field characteristics B and E. 

The Lorentz gauge condition [see (56.8)] in the four-dimensional 
form is 


3 


2 


0A* 
dx 11 


0 


(60.18) 


This signifies that the four-potential is chosen so that its four-diver- 
gence vanishes [compare with (47.7)]. 


61. Electromagnetic Field Tensor 

Let us go over from potentials to the force characteristics E and B 
of a field. This transition is performed by formulas (56.1) and (56.3). 
For convenience, we shall repeat these formulas: 


B ='[VA] 

E =— V9--L 


dA 

dt{ 


(61.1) 
' (61.2) 
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Let us write the expressions for the components of the vector E: 


E x ~ 


E y = 


3cp 


dA x 

dx 

c 

at 

d(p 


8Ay 

dy 

c 

at 

d<p 

J_ 

dA t 

dz 

c 

at 


In four-dimensional symbols, these formulas can be written as 


E x 


E t 


dA 1 


8 An 


dx° 

dx 1 

8A 2 

dA 0 _ 

dx° 

dx 2 

8A 3 

dA 0 _ 

dx° 

dx 3 


V5^!-VMo 
VoA 2 — V?A 0 
Vo^3- VMo 


) 


(61.3) 


[see formulas (XII.42) for the covariant components of the gradient 
four-operator and formulas (60.15) for the covariant components of 
the four-potential. By the last formulas, for instance, A x — — A lt 
etc.]. 

Now let us write the expressions for the components of the vector 
B. According to (61.1) 


B x 

By 

B z 


3A Z 

dAy _ 

_ 3A 2 

M, 

dy 

dz 

dx 3 

dx 2 

8A X 

dA z 

_ 3A a 

8A X 

dz 

dx 

dx 1 

dx 3 

dAy 

! 

H 

1 

dA 1 

CJ 

1 

dx 

dy 

dx 2 

dx 1 


V3A2— - vf-'ds 
= V?A 3 -vMi 
--VtAi-VtA 2 


(61.4) 


It is known from tensor algebra that expressions of the kind a^bv — 
— a v b^ are covariant components of an antisymmetric second-rank 
tensor ( a ^ and b v are covariant components of arbitrary vectors). 
It follow^ from formulas (61.3) and (61.4) that the components of 
the vectors E and B can be interpreted as covariant components of 
the antisymmetric four-tensor 

F „ v = A v - V54* = (61.5) 


The latter is called the electromagnetic field tensor. 

Raising the indices p and v on both sides of (61.5) and lowering 
them on the right side, we get an expression for the contravariant 
components of the electromagnetic field tensor: 


p\i\ dAy dA ^ 

dx^ dxy 


(61.6) 
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A comparison of formulas (61.3) and (61.4) with expression (61.5) 
shows that 


E x = F 01 — — ^io. E y = F 02 = — F 20 ) E z — F 03 = — F 3a 1 

B x — F 32 = — F 22 , E y = F 13 — — F S1 , B z — F 21 — — F 12 j 

The tensor F can thus be written as follows: 



Ex Ey E z \ 

0 ~B t By 

B z 0 -B x I 
By B x 0 J 


(61.7) 


(61.8) 


Taking into account formulas (XII. 58), let us write the values of 
the contravariant components of the electromagnetic field tensor: 


(O = 


( 0 -E x 
E x 0 

Ey B z 
E z -By 


-Ey 

~B Z 

0 

B x 



(61.9) 


Hence, in four-dimensional space, an electromagnetic field is 
described with the aid of one antisymmetric second-rank tensor in- 
stead of two vectors (E and B). 

We must note that whereas the four-potential A^- is determined 
non-uniquely, the components of the tensor F are unique. Indeed, 
let us replace the components Ay, in (61.5) with the quantities Ay 
according to formulas (60.16) and (60.17). If p and v equal 1, 2, 
and 3, we obtain 

-p dAy &dy 3 , 4 V (x 1 , x 2 , x 3 ) 

“ v ”15 r dF r ~lA r+ dx»dx v 

SAy 32^ ( x i ( x 8) 3^4 v dAy 

dx v dx v dx^ dx ^ dx v 

If one of the indices (for instance p) is zero, we get the relation 

-p dA v dA 0 dAy . d ( dty (x 1 , x 3 , x 3 ) \ 

ov dx° g x v dx° ' dx° \ 3 x v / 

dAg dC dAy dAg r 

~~d^~~dF r ~~8F° ov 

We have thus proved that the quantities Fy V are determined 
uniquely. 
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62. Field Transformation Formulas 


Formulas for transforming the components of an antisymmetric 
four-tensor are established in Appendix XII [see formulas (XII. 62)]. 
Substitution of the values (35.8) and (35.9) for a 0 and a x results in 
these formulas becoming 


a >02 — A °* — M 12 /j'03 - ^4 03 — M 13 

’ Y t— p 2 ’ / 1— p 2 






A' 13 -- 


A 13 — M 03 

YT=P 


^>23 _ ^23 


(62.1> 


(here p = vjc). 

Writing formulas (62.1) for the tensor (61.9), we obtain 


P’ 01 _ /?oi ) 

«■>(>» 

“ / ’ 


i.e. £; = £* 


i.e. ^ — 


/r=p* 


etc. Writing out relations (62.1) for all the components of the tensor 
F 11V and replacing F |XV with the relevant values of and B h , we 
arrive at formulas for transforming the components of the vectors 
E and B in a transition from one inertial reference frame to another: 


E' X = E„ 
B' x = B xt 


„ E 9 -$B t 

V /l-p 2 ’ 

„ By + fiE 2 

' v /l^p 2 ’ 


,, _ Ez + fiBy 
Z_ / 
f , Bz-Wy 

1 Y i-p 2 


(62.2) 


The formulas for the inverse transformation differ from' these for- 
mulas only in the sign of the terms containing the factor fS (i.e. in 
the sign of v 0 ). 

Resolving the vectors E and B into components parallel to the 
x-axis (find, consequently, to the vector v 0 ) and components perpen- 
dicular to this axis (i.e. writing, for example, E as E|| + E x ), we 
can write formulas (62.2) in the vector form 1 : 


E'n = E n , 

B a — B n , 


El 

Bi 


E x +(l/c) [v,BJ 
/l-^/c 2 
B x -(l/c)[v 0 E x ] 
/ 1 — vile 3 


(62.3) 


1 We must note that since Bn and v 0 are collinear, f[v 0 B] = [v 0 B n ] + 
+ [v 0 Bj.]=[v 0 Bx]. Similarly, [v 0 E] = [v 0 Ej.]. For this reason in the vector 
products of formulas (62.3), the subscript “JL” at Bj. and Ej. maybe discarded: 
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(recall that v x — v 0 , and v y = v, = 0). A glance at these formulas 
shows that the longitudinal 1 components of the fields in a transition 
from one reference frame to another do not change, and only the 
lateral components are transformed. 

If P «C 1 (i.e. v 0 <C c), the expression 1/]/ 1 — P 2 * 4 approximately 
1 

equals l-fyp 2 . Consequently, within terms of the order of p = 
= v 0 /c, formulas (62.2) can be written as follows: 

Ex — E x , E v = Ey ( Vg/c ) B z , E z — E z -|- ( vjc ) B y t 

B' x = B x , By = B y + (Vg/c) E z , B Z = B Z - (v 0 /c) E y ) ( ' 

It is not difficult to see that these formulas can be written in the 
vector form: 

E' = E + 4-[v 0 B],' B ' — B — [v 0 Ej (62.5) 

[compare with formulas (62.3)]. 

If there is only an electric field in the frame K (i.e. E ^ 0 and 
B = 0), both fields exist in the frame K ' . By formulas (62.5), these 
fields are 

E' = E, B' = — — [v 0 E] 

Having in view that E = E', we can write that 

B'=— f[v 0 E'] (62.6) 

Relation (62.6) 2 indicates that the fields B' and E' are mutually 
perpendicular. It can also be seen from this relation that the field 
B' is perpendicular to the vector v 0 , i.e. to the x-axis. 

It can be shown similarly that when there is only a magnetic 
field in the frame K , the vectors B' and E' are related by the expres- 
sion 

E' = [v 0 B'J (62.7) 

Consequently, in this case too the fields B' and E' are mutually 
perpendicular. In addition, E' is perpendicular to v 0 , i.e. to the 
x-axis. 

Hence, if only one of the fields (E or B) exists in the frame K . 
in any other frame K' the fields B' and E' are mutually perpendic- 
ular. The opposite conclusion also holds: if the fields B and E 
are mutually perpendicular in a frame K (and the magnitudes of 

1 For simplicity, we shall use this name for the components parallel to the 

vector of the relative velocity of the frames K and K' (to the vector v 0 ). We 

shall call the perpendicular components lateral ones. 

4 We invite our reader to convince himself that the same relation between 
the vector B' and E' can also be obtained in the case being considered from the 
accurate formulas (62.2). 
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the fields satisfy the conditions indicated below), frames K' exist 
in which the field is purely electric, and also frames in which the 
field is purely magnetic. Let us find the velocities of the relevant 
systems beginning with a treatment of the case when v 0 <g; c. 

Assume that the vectors B and E are mutually perpendicular. 
It follows from formulas (62.5) that for the field in the frame K' 
to be purely electric (i.e. for B' to vanish), the vector v 0 must satisfy 
the condition 

B = ~ [v,E] 


This condition will be observed if the vector v 0 is perpendicular 
to B (the vector E is perpendicular to B according to our assump- 
tion) and, in addition, v 0 E sin a = cB, where a is the angle be- 
tween the vectors v 0 and E. Hence, the field is purely electric in all 
frames travelling in directions perpendicular to B provided that 
the velocity v 0 of the given frame is cB/E sin a. Since the velocity 
v 0 of the system cannot exceed c, the reference frames being consid- 
ered exist only provided that cB <i E sin a. If E ^ cB, this 
condition is not observed at any angle a. Therefore, in this case, 
notwithstanding the mutual perpendicularity of B and E, no frames 
exist in which the field is purely electric. 

It is not difficult to see that the result we have obtained is true 
without stipulating that v 0 <C c. For this purpose, let us turn to 
formulas (62.3). Assume that B and E are mutually perpendicular. 
Let us take the frame K' whose velocity v 0 is perpendicular to B 
and equal in magnitude to cBlE sin a (here a is the angle between 
the vectors v 0 and E). Since the vectors v 0 and B are mutually 
perpendicular, the component B| t vanishes. By formulas (62.3), 
Bj| also vanishes. Let us consider the numerator of formula (62.3) 
for B' x . With the direction of v 0 we have chosen, the vector B x 
equals B. The vector product of the vectors v 0 and E can be written 
as 

. [v„E] = [v 0 E| ( ] + [v„E A ] = [v 0 E a ] 

(the first term vanishes because the vectors v 0 and Ejj are collinear). 
Consequently, the formula for B x for the case being considered can 
be written as follows: 


R'. . B — (1/c) [vpEj 

1 /l — vg/c 2 


( 62 . 8 ) 


According to our condition, B is perpendicular to both E and v 0 . 
Therefore, the vector [v 0 E] is collinear to the vector B. By properly 
choosing the direction of the vector v 0 (to the right or the left), 
we can make the vectors [v 0 E] and B have the same direction. 
The numerator of formula (62.8) thus contains the diSerence of 
two identically directed vectors whose magnitudes are B and 
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(He) v 0 E sin a. If v 0 — cBIE sin a, the magnitudes of these vectors 
will be the same, and the numerator in the formula for will 
vanish. Hence, both B(| and B' x are absent in this case. We can show 
in a similar way that the field is purely magnetic with mutual 
perpendicularity of the fields B and E in the frames K’ travelling 
in directions perpendicular to the vector E at a speed v„ equal to 
cE/B sin a (here a is the angle between the vectors v 0 and B). This 
statement holds provided that cE <_ B sin a. If B ^ cE, no systems 
exist in which the field is purely magnetic. 


63. Field Invarianfs 


Let us form the expression 

B'2 — E' 2 = £ — 2 E' k 2 = 2 (B' h * - E' h 2 ) 

and substitute for the quantities B' k and E' h in it their expressions 
in terms of the unprimed components [see (62.2)1: 

B'2 _ E'2 = 2 - E ft 2 ) = B *~ El 

, {B v + $E z )'-(E v -§Btf + {B z -$E y Y~(E z + $B y )* 
+ 1-P 2 

It is a simple matter to see that the right-hand side is reduced to 
the form 

(B% - E%) + (B* - El) + (B\ - El) 

We thus arrive at the conclusion that the difference between the 
squares of the vectors B and E has the same value in all inertial 
reference frames, i.e. is an invariant: 


B 2 - E 2 = inv 


(63.1) 


Now let us form the scalar product of the vectors E' and B', i.e. the 
sum 2 E 'hB >[• Introducing into this sum instead of E' y and B' k their 
values from (62. 2), we obtain 


2 E' h B' h = E X B X 


(E y - p5 2 ) (B„ + P E z ) + (E z + PS„) (B t - P B v ) 
1-P a 


S £ kB k 


Consequently, the scalar product of the vectors E and B is also 
an invariant: 


EB — inv 


(63.2) 


Inspection of (63.2) shows that when the fields B and E are mutual- 
ly perpendicular (i.e. EB = 0) in a certain reference frame, they 
are mutually] perpendicular in any other inertial reference frame. 


15-018 
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Inspection of (63.1) shows that when the magnitudes of the vec- 
tors B and E are the same (i.e. B 2 — E 2 = 0) in a certain reference 
frame, they are the same in any other inertial reference frame. 

In addition, the following conclusions can be drawn from the 
invariance of expressions (63.1) and (63.2). When the vectors B 
and E form an acute (or obtuse) angle (i.e. EB is greater or loss 
than zero) in a certain reference frame, they form an acute (or obtuse) 
angle in any other frame. If B >• E (or B < E) (i.e. B 2 — E 2 is 
greater or less than zero) in a certain frame, the same relation will 
be retained between the magnitudes of the vectors B and E in any 
other frame. 

When both invariants equal zero, the vectors B and E in all 
inertial reference frames are mutually perpendicular and equal in 
magnitude. 

If only the invariant (63.2) equals zero, i.e. EB = 0, it is pos- 
sible to find a reference frame in which either the field B or E is 
zero depending on the value of the expression B 2 — E 2 . The opposite 
is also true: if one of the fields B or E is zero in a certain frame, the 
fields in any other frame will be mutually perpendicular (we already 
arrived at this conclusion earlier when analysing the formulas for 
field transformation). 

One must have in view that the fields B and E, generally speaking, 
vary from point to point. Therefore, the invariants (63.1) and (63.2) 
may have different values at different points. The above statements 
on the properties of fields relate to their points for which the assump- 
tions we have adopted are observed (for instance, the equality to 
zero of a given invariant, etc.). If these assumptions are observed 
at all the points of a field, the statements on the properties of fields 
will also naturally relate to all the points. 

With a view to the above remarks, let us assume that at appoint 
of a field in the frame K, the product EB is non-zero, i.e. that the 
fields at the given point are not perpendicular to each other. We can 
now find such a reference frame K' in which the fields at a given 
point a_re parallel to eacli other. In this frame, E'B' = E' B' , so 
that we obtain two equations: 

B' 2 - E' 2 -- B 2 - E 2 , E’B' = EB 

Solving these equations simultaneously, we find the values of the 
quantities E' and B' in the reference frame in which the fields E' 
and B' are parallel (the vectors E and B are set). 

The invariants of a field can be found proceeding from the general 
properties of tensors. It is shown in Appendix X that the multipli- 
cation of tensors of ranks m and n yields a tensor of rank (m + n), 
and also that the contraction of a tensor over any pair of indices 
lowers the rank of the tensor by two. Particularly, the contraction 
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of a second-rank tensor, equal to the sum of its diagonal components, 
is called the trace of the tensor and is an invariant [see formulas 
(X.21) and (XII.63)]. 

For an antisymmetric tensor like the electromagnetic field tensor, 
the trace is zero so that this invariant is of no interest. 

Let us form the product of the tensors (01 .8) and (61.9). i.e. a ten- 
sor with the components F tlv FP a . It is a tensor of the fourth rank. 
We shall perform a double contraction of this tensor, assuming the 
indices p and p, and also v and a, to be equal and summating over 
these indices. As a result, the rank of the tensor will lower by four 
(each of the two contractions lowers the rank by two), and we obtain 
a zero-rank tensor, i.e. an invariant: 

2 F tlv F yiv = inv 

H, v 

Introduction of the values of F^ and Fn v [see (61.8) and (61.9)] 
yields 

3 3 3 

2 F^ = - 2 s £1 + 2 2 B% = 2 (B2- E2) = inv (63.3) 

a. v=o k—\ k—i 

which agrees with formula (63.1). 

Now let us form a four-tensor of the eighth rank 

e nW aP F v6 (63.4) 

where eavpo j s an absolutely antisymmetric unit four-pseudotensor 
of the fourth rank [see Appendix XII, the text following formu- 
la (XII. 68)]. The non-zero components of this tensor equal -fl 
or — 1 depending on whether an even or odd number of permutations 
is needed to obtain the given sequence of indices p, v, p. a from 
the sequence 0, 1, 2, 3. Table 63.1 gives all the permutations of 
the indices with an indication of the sign at 1 corresponding to them. 

TABLE 63.1 


Sequence 

Sign 

Sequence 


Sequence 

Sign 

Sequence 

Sign 

1 

+ 


+ 

2013 

+ 

3210 

+ 

||w i ; jM 


BIB 


2031 


3201 

— 

MSj; 

+ 

B SOUt 

+ 

2130 

+ 

3102 

+ 

[M 


S il 


2103 


3120 

— 

Mi* Ml 

+ 


+ 

2301 

+ 

3021 

+ 

0321 


■ 


2310 


3012 



The four-fold contraction of the tensor (63.4) is the invariant 

2 e M.vp<j^ v /z po = inv (63.5) 

M* v, p, o 


15 * 
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Having written 24 non-zero components of the sum (63.5) (the 
signs of the components must be taken from Table 63.1), it is not 
difficult to see that they can be divided into six groups each includ- 
ing four coinciding expressions: 

'xL — 4E 01 E 23 4EqjE 32 ~f~ 4F 02 F 31 — 4 F 02 F 13 

+ 4E o3 E I2 -4E 03 E 21 (63.6) 

Owing to the antisymmetric nature of the tensor F yv , the products 
F 01 F at = — F 01^2 3 > etc. Expression (63.6) is therefore simplified 
as follows: 

'lx = 8 (F ol F 23 +• F o 2 F 3 i + F 03 F 12 ) = inv 

The expression in parentheses is obviously also invariant. Substitu- 
tion of the values for F from (61.8) yields 

E x (-5,) + E y (~B y ) + E z (~B Z ) = inv 
which coincides with (63.2). 


64. Maxwell's Equations in the Four-Dimensional Form 


The first pair of Maxwell’s equations [i.e. Eqs. (55.12)1 can he 
written as a single equation for the components of the tensor (61.8): 


&F nv 8F VP dF m 

dx p dx p dx v 


(64.1) 


We must note that the indices in each of the addends form a cyclic 
.transposition of the sequence p, v, p. 

- Expression (64.1) is a set of four equations, the first of which is 
obtained at p, v, p equal to 0, 1, 2, respectively, the second at 
p, v, p equal to 1, 2, 3, the third at p, v, p equal to 2, 3, 0, 
and, finally, the fourth at p, v, p equal to 3, 0, 1. Owing to anti- 
symmetry of the tensor F yv , the equation obtained at any other 
combination of the three non-coinciding indices reduces to one of 
the four indicated equations. 

Let us-write Eq. (64.1) assuming that p = 0, v = 1, and p — 2: 

i 8F 12 i 8F 2 q ^ 

' dx 2 dx° ' dx 1 

Introducing the values of F I1V and the coordinates x^, we obtain 

dE x 1 dB t 8E y 

dy c dt dx 

whence 

dE v 0E X 1 8B Z 

dx dy ~~ c dt 
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The found relation is the z-tli component of the vector equation 
(55.12). Similarly, the equations for p = 2, v = 3, p = 0, and 
p = 3, v = 0, p — 1 give the ar-th and p-th components of the 
same equation. (An equation is obtained for the component whose 
index is absent in the set of values of p. v. p.) 

Assuming that p — 1, v ------ 2, p — 3, we arrive at the formula 

dF g _i_ d/' as j_ dFsx n 

dx 3 “ r dx' ~ T dx 2 u 


which after introduction of the values of F 4V and a* 1 becomes 

_ dB z _ dB x __ 9 B v = 
dz dx dy 

This is equivalent to the second of equations (55.10). 

We have thus seen that Eq. (64.1) is equivalent to the first pair 
of Maxwell’s equations. 

The second pair of Maxwell’s equations, i.e. equations (55.13), 
can be written as 
3 

0i = 0.1, 2, 3) (64.2) 

v=0 

Indeed, let us assume, for instance, that p == 1. Equation (64.2) 
can therefore be written as follows: 

dffio dF \ 2 _ dfi3 4jt 

dx ® ' dx 3 ' dx*' c ^ 


(F n = 0). Substituting their values for F^ v and aA, and also group- 
ing the terms as required, we get the equation 

dB z aB v in . 1 9E X 

dy dz c Js+ c dt 


which is the ar-th component of the vector equation (55.13). Similarly, 
the equations for p = 2 and p = 3 give the i/-th and z-tli components 
of the same equation. 

When p = 0, Eq. (64.2) becomes 


OF oi dl'» 2 dF° 3 

dx 1 dx- ' dx 3 


— 4np 


Introduction of the values of F> lv and x* 1 yields 


dE x dE u 

dx dy 


dE z 

dz 


= — 4np 


which coincides with the second oi equations (55.13). 

We have thus shown that Eq. (64.2) is equivalent to the second 
pair of Maxwell’s equations. 
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65. Equation of Motion of a Particle in a Field 

According to (38.16), the equation of motion of a charged particle 
in an electromagnetic field is 

^(-V=^)=« E + TtvBl (65.D 

Dividing both sides of (65.1) by 'j/'l — i> 2 /c 2 and taking into account 
that dtY 1 — f 2 /c 2 is dx [see formula (34.13)], we obtain 
d i m\ \ _ g cE+[vB] 

dx V Y 1— v 2 lc 2 > c Y 1 — t> 2 /c 2 

or, in components, 

d I mv x ^ e c Ex~r v yB z — v zBy 

dx \ Y 1 — u 2 /c 2 / ~ e Y 1 — r 2 /c 2 

d I mv y ) _ e_ c Ey + v z B x —VxBz 

dx \ Y l-v^lc* / ~ e Y 1— ti 2 /c 2 

d ( mv z \ _ e c B z + v xBy — v y B x 

dx \ Y 1— y 2 /c 2 / c Y 1 — u 2 /c 2 

But r ft /]/ 1 — n 2 /c 2 = w h , c/]/ 1 — y 2 /c 2 = n° [see (36.6)]. Further, 

P — P 10 p = p 20 P = P30 u _ P32 _ P23 D _ P13 _ 

= -F 3\’fl 2 = F 21 = -F 12 . Therefore, Eqs. (65.2) can be written 
as follows: 

m^- = j(F^u 0 +F^u z + F^u 3 ) ) 

(65.3) 

On the right-hand side, we have used the covariant components 
of, the four-velocity so that all the terms will have a plus sign. 

Now let us take advantage of the fact that the rate of change 
in the energy of a particle equals the work done in unit time by 
the forces acting on the particle: 

i(Y&p-)“( eE + 7i'' B 0 v = e& < 65 - 4 > 

Dividing both sides of this equation by c "j/" 1 — n 2 /c 2 , we obtain 
m~ = Y(F 0l u l + F^u 2 + F^u 3 ) (65.5) 


m l£- = T ( F2 ° u ° + F2 ' u ' + F23u J 
m ^f = T ( F3 ° u o + F3lu i + i?32 «2) 
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The set of equations (65.3) and (65.5) can be written as a single 
equation 

3 

^ v Uv 01 = 0, 1, 2, 3) (65.6) 

v=0 

This is the equation of motion of a particle in a field written in the 
four-dimensional form. Its space components are equivalent to 
Eq. (65.1), and its time component — to Eq. (65.4). 

Equation (65.6) can be written in a somewhat different way. 
Let us lower the free index p on both sides of formula (65.6). 
In addition, let us simultaneously lower the dummy index v in F f iV , 
and raise it in u v . The result is 

du 3 

m -dT = T 2 F ^ uV ^ = 2 > 3 ) < 65 - 7 ) 

v=0 

Equation (65.7) can be obtained directly from Eqs. (65.1) and 
(65.4) if we replace the left-hand side in them with the covariant 
components of the four-velocity, and write the quantities E k and B h 
as the components of the tensor (61.8). 




Chapter XII 


THE VARIATIONAL PRINCIPLE 
IN ELECTRODYNAMICS 


66. Action (or a Charged Particle in an Electromagnetic 
Field 

Agreement with experiments is obtained if we take the following 
expression as the action for a particle in a field: 

2 3 

( — mods — y ^ \ dx^ (66.1) 

; ‘ 1 H=0 

where m is the mass of the particle, e is its charge, and A ^ is the 
four-potential of the field. We draw attention to the fact that there 
is an invariant in the integrand, as there should be. At = 0, 
expression (66.1) transforms into the action for a free particle (see 
formula (39.7)). 

We must note that the ambiguity of the potential does not affect 
the equations of motion. Indeed, substituting for A^ in (66.1) the 
quantities A n [see formulas (60.16) and (60.17)], we get additional 
terms in the integrand: 

3 

— y 2 ““ dx h + C dx° J == — — [dr|) (x\x 2 t x 3 ) -f C dx ° ] 

Integration of these terms yields a constant quantity: 

2 

-iS (d * + C dx0) = - T ^ (2) + + T f’ ♦ d) + Cx °o] 

i 

which in variation of the action vanishes. 

The right-hand side of (66.1) consists of two terms. The first 
depends only on the properties of a particle (on m). The second 
describes the interaction of the particle with the field; it contains 
accordingly both a quantity characterizing the particle (the charge e) 
and a quantity characterizing the field (the potential A u ). In general, 
we ought to include another term describing the field itself. But in 
considering the motion of a particle in a given field, this term may 
be left out of consideration because owing to the determinacy of 
the field it should not vary. True, this statement holds only provided 
that the charge of the particle is so small that we may disregard 
its influence on the field. In considering the motion of a particle 
in a given field, we shall assume that this condition is satisfied. 
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We shall obtain equations of motion of a particle proceeding 
from the principle of least action. According to this principle, the- 
following condition is observed for the true trajectory of a particle: 

2 

85 = 6 j ^ — me y 2 dx^ dx^ — 2 ^ xVd ) = 0 

l 

(we have made the substitution ds- -- 2 dx ti dx^). Performing 
variation, we obtain 

»S= j 2 a.dfe.-f 2 «4.*»)=0 

1 

( 66 . 2 > 

[see formula (39.9); recall that 6 dxv- = d 6 a;M. 

Let us integrate the first two terms in the integrand by parts. 
As a result, the first term becomes 


J m25r^rfx 


where t is the proper time of the particle, and u n is the four-velocity 
of the particle [see formula (39.11)]. Integration of the second term' 
by parts yields 



2 



i 


The first expression on the right vanishes because at the ends of 
the trajectory 6 a;H- = 0. Consequently, after integration by parts 
of the first two terms, expression ( 6 G. 2 ) becomes 

2 d 

85 = j ( m 2 8^ 1 di + ^- 2 8^ dA y< _ — 2 SA V dx v ) = 0 

in n v 

(it will come to light in the following text that it is expedient to- 
denote the dummy index in the third term by the symbol v). 
Now let us perform the substitutions 


<Mn=2 

V 

8A v =2 


dA v i _ dAy, 

dx x — >. — — u v dx in the second term, and 

dx v AJ dj y 

V 

a A 

— 7 - 8 ^ and dx y = u v dx in the third term 
dx ^ 
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The result is 
2 


65 = j ( m S 6a ^iF _dT +f2 8x11 S-^- uVrfT 

lH H V 


dA u 


dx» 


which can be written as 

inn ^ e 


1 n 




For the condition indicated above to be satisfied at arbitrary values 
■of bad 1 , it is essential that all the terms in parentheses be zero. Hence, 
we obtain the relations 


m ■ 


dup. 


dx 


2( 


dA v 


dx ** 


dA V- \ v 

U v 

dx v 1 


or 


du. 


m 


dx 


-= i2*. 


|*V“ 


(p-0, 1, 2, 3) 


(66.3) 


(see formula (61.5)1, which coincides with (65.7). 

We have thus obtained an equation of motion of a particle in 
a field proceeding from the principle of least action. 


67. Action for an Electromagnetic Field 

In the preceding section, we considered the field in which a particle 
moves to be preset, and in this connection we took no account of 
the term in the action describing the properties of the field itself. 
Now let us consider a system made up of particles in an electromag- 
netic field and attempt to find equations determining the field 
proceeding from the principle of least action. Here we can give 
up our assumption on the smallness of the particles’ charges and 
obtain equations for the true field, i.e. the one obtained upon super- 
position of the external field and that produced by the charges 
themselves. Consequently, the values of will have to depend 
•on the positions and velocities of the particles. 

The action for the system field + particle must consist of three 
terms: 

S — Sf + S m + S mt (67.1) 

Here S t is the part of the action that depends only on the properties 
of the field itself, i.e. the action for the field in the absence of char- 
ge s; S m is the part of the action that depends only on the properties 
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of the particles, i.e. the action for the free charges. And, finally, 
5 inf is the part of the action that is due to the interaction between 
the particles and the field. 

As regards the last two terms, their values can be obtained by 
summating expressions (66.1) for all the particles. Hence, 

N 

*S m = — E m a c j ds a (67.2) 

£ 2=1 

N 3 

T- f E (67.3) 

a—l " u— 0 

Here a is the number of the particle, N is the number of particles 
considered, and A^ a is the potential of the field at the point of four- 
space where the a-th particle is. The subscript a must be distinguished 
from the indices p, v, p, . . . used to designate the components 
of four-vectors and four-tensors. For the latter indices, we must 
distinguish between upper and lower positions. This distinction 
lias no meaning for the index a. 

Expression (67.3) can be written in a different way. For this 
purpose, let us substitute ( dx%/dt ) dt for dx% in it, i.e. write 

a (i 

Next let us write the collection of point charges e a as a charge 
distributed in space with the density 

P = S e a S (r — r a ) 

a 

[see (41.12)]. Therefore, the charge confined in the volume element 
dV can be written as de = p dV, and a sum of the form 2 e a/ ( x a • 1/a, 
z a ) can be replaced with the integral 

j p (x, y . z) / (x, y , z) dV 

As a result, we obtain 

Smf=-4- j pdv j 2 ^ dt 

Now let us take into account that p (dx^/dt) — /**. where is 
a component of the four-current [see formula (60.8)]. Hence, we 
can write 

s mt = -4- j E A ^ dV A »i* dV * < 67 - 4 ) 

m. a 

where dV* = dx° dx 1 dx 2 dx 3 = c dV dt. 
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To obtain an expression for S t , let us take into account that, as 
shown experimentally, an electromagnetic field obeys the super- 
position principle. Therefore, the equations for a field must be 
linear differential equations. The equations of a field are obtained 
by variation of the action. But in variation, the power of the inte- 
grand lowers by one. Consequently, the equations will be linear if 
the action contains in the integrand an expression quadratic with 
respect to the field. In addition, this expression must be invariant. 
There are two very simple quadratic invariants that can be formed 
from the characteristics of a field: anc * 2^Vv^ tv - The 

first of them is not suitable for our purposes because the four-poten- 
tial is determined ambiguously. We thus arrive at the conclusion 
that the integrand in the action must consist of the invariant 
S^’av^ |AV - To obtain the action for an entire field, we must integrate 
over the entire four-space where the field is non-zero. Hence, we 
obtain the expression 

S t = a j 2 V^dV* 

JX, V 

where a is a constant, and dV* — c dV dt. Integration over the 
coordinates is performed over the entire three-dimensional space, 
and over the time between two preset instants fj and f 2 . 

We obtain correct equations of a field from the expression we have 
found if we assume (in the Gaussian system of units) that a = 
= — 1/16jtc. Hence, 

s <— - eIHv’’®* < 67 - s > 

ft. V 

Summation of expressions (67.2), (67.4), and (67.5) yields the 
action for the system field + particles: 

5= - 2 m a c J ds a — 1 J 2 J 2 F^dV* 

o H H, v 

(67.6) 

By formula (63.3), we have ^F flv F^ v — 2 (B 2 — E 2 ). Introduc- 
ing this value into (67.5), and also substituting c dV dt for dV * , 
we can transform the expression for S t as follows: 

St=) dtj ^(Et-B^dV (67.7) 

t\ 

A glance at this formula shows that the Lagrangian for the field 
is determined by the expression 

l '=iM (E’-BW 


(67.8) 
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We have thus established the form of the action for an electro- 
magnetic field. We shall find the equations of a field proceeding 
from the principle of least action in the following section. 

68. Derivation of Maxwell's Equations from 
the Principle of Least Action 

Let us find the equations of a field, considering the motion of the 
particles to he preset. In this case, the term S m in the action may 
be left out of consideration because in view of the determinacy 
in the motion of the charges it should not vary. We shall thus proceed 
from the expression 

S = Sf + S mf =— j 'ZA^dV* (68.1) 

n.v a 

[see (67.5) and (67.4)1. 

Let us calculate the variation of expression (68.1) and equate 
it to zero. We shall take into account that owing to the determinacy 
of the motion of the charges the current /a should not vary. Hence, 

6S “ - iL 5 6 ( 2 2 dV* (68.2) 

We determine the variation in the first integrand: 

6 2 = 2 F vlv SF' xv + 2 F^SF^ 

We shall raise the indices of the first factor in the first of the sums 
and simultaneously lower them in the second factor. The result is 

6 2 F ilv F v ’ v = 22 F^SF^y, 

With a view to formula (61.5) for F tlV , we shall write this expression 
as follows: 

6SV‘ , =2 2r""6(^-^-) 

Taking advantage of the antisymmetric nature of the tensor 
we shall substitute — F v n for j n the first sum on the right-hand 
side, and then exchange the places of the indices p and v. As a result, 
the first sum will become identical to the second one, and we obtain 

i 8 2v ,lv =-4E r8i T 


(68.3) 
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By changing the sequence of differentiation and variation of the 
quantities we reduce (68.3) to the form 

OX 

Now let us transform the expression obtained by the formula uv' — 
— {uv)' — u'v: 

6 S F ^ = - 4 2 ~ (^ v ^u) +4 E ~ r 

Substituting this expression into (68.2), we arrive at the following 
formula for the variation of the action: 

J 2 1 2 

H,v H,V 

2>^dV* (68.4) 

H 

The first of the three integrals can be transformed according 
to the Ostrogradsky-Gauss formula into a surface integral: 

J 2 -^r ( F> XV6A ^ dV *- J S F *^ A » d fy 

n, V n.v 

At the boundary of the four-volume being considered, 6A„ — 0. 
Hence, the integral we have written vanishes. Consequently, only 
the second and third terms must be retained in formula (68.4). 
Combining them and factoring out the common factor 6A„, we 
obtain 

6s -~H 

M- V 

Owing to the arbitrary nature of the variations 6.4,,, the value 
of 6 S we have found may be zero only if all the expressions in paren- 
theses vanish. We thus arrive at the equations 

2^ = __i£L/* 01 = 0,1,2,3) (68.5) 

that are the second pair of Maxwell’s equations [see Eq. (64.2)]. 

We must note that the relation between the fields B and E, on 
the one hand, and the charges and currents, on the other, is deter- 
mined exactly by the second pair of Maxwell’s equations [see 
Eqs. (55.13)]. The first pair of Maxwell’s equations expresses the 
properties of the fields B and E and their relation to each other 
[see Eqs. (55.12)]. 
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69. Energy-Momentum Tensor of an Electromagnetic Field 1 

We established in Sec. 67 that the action for an electromagnetic 
field is determined by the expression 

1 


S,= 


16 3TC 


52 


F^F^dV 


(69. 1> 


ft. v 


[see formula (67.5)]. 

A comparison of formulas (69.1) and (40.2) shows that we must 
take the following expression as the density of the Lagrangian for 
a field: 

L '-~- nsr 2 V“ v <w>- 2 > 

ft, V 

We must take the components of the potential A ^ as the general- 
ized coordinates q a for the field, and the derivatives of these com- 
ponents with respect to the coordinates x v as the generalized veloc- 
ities. To simplify the writing of the formulas, let us introduce the 
designation 

dAa (69.3) 


dx^ 




Equating the variation of the action to zero and performing the 
same calculations that led us to the equations of motion (40.12) 
in Sec. 40, we shall arrive at the relations 
dL* _ y 9 dL * 


dA u 


dx* da v- 


(p = 0, 1, 2, 3) 


(69.4) 


Equations (69.4) are equations of a field. To determine the values 
of the derivatives in them, let us write the variation of the func- 
tion L*. According to the general rules for calculating a variation, 
we have 

6t *-2 !£ 04. + 2 •!£««». < 69 - 5 > 

ft R. v 

On the other hand, the variation of the function (69.2) is 


6 L* = 


1 


« S V " = ± 2 E -6 ^=^-2 ^ 


dA u 


1 


ftv 


16 n ” 2 J * ftv- - 4n ZJ * ” dx v ~ 4it 

ft. v ft, V ft, V 

[see formula (68.3)]. 

A comparison of the expression obtained with formula (69.5) 
allows us to conclude that 


dL* 

dAu. 


— 0 , 


dL* 

da II, 


4n 


^.f» v 


(69.6) 


1 Before beginning to read this section, recall the contents of Sec. 40. 
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Substituting — F" 11 for F^ v in the second expression and then exchang- 
ing the places of the indices p and v, we find that 


dL* 

da vll 


= L. F* v 

4n 


(69.7) 


Introduction of the values (69.6) into formula (69.4) leads to the 
following “equations of motion” for an electromagnetic field: 


2-^ = ° ((* = 0,1,2, 3) (69.8) 


(we have omitted the factor l/4n). Equation (69.8) is Maxwell’s 
equation (64.2) written for the case when /•* = 0. This result is 
what should have been expected because we proceeded from the 
action for only the field without any charges. 

Now let us establish the form of the energy-momentum tensor 
for an electromagnetic field. Substituting the quantities a plL [see 

(40.3) and (69.3)] for q all in formula (40.16), we obtain the following 
•expression for the components of this tensor: 

p 


Let us replace"dL*/chZp V with — (l/4jt).E vp in accordance with (69.7). 
In addition, let us substitute for a Pil its value from (69.3), and 
for L* expression (69.2). The result is 


T 


V 

p 



16 a 


»; 2 V* 

3, V 


(69.9) 


[in the second sum, we may not designate the dummy indices by 
the letters p and v as was done in formula (69.2) because in formula 
(69.9) p and v were already chosen as free indices]. 

The tensor (69.9) is not symmetric. To make it symmetric, let 
us add to it the '^tensor 



(69.10) 


which, as we shall show, can be written as 

P 

[see formula (40.18)]. Indeed, let us apply to expression (69.10) 
the transformation uv = (uv)' — u'v: 



p p p 
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Owing to (69.8) the second sum on the right-hand side vanishes 
(A^ can be factored out of the sum sign). Consequently, we have 
brought the tensor (69.10) to the form 


G 


V 

4 



Since the expression in parentheses is antisymmetric with respect 
to the indices v and p, expression (69.10) may be added to the ten- 
sor (69.9) [see formula (40.18) and the text associated with it]. 
Summation of expressions (69.9) and (69.10) gives the tensor 


rp V 

1 it — 


1_ VI dA P pvp , _1_ VI 0A * r»Vp 

4 n 4it " a~P 


d A Li 


= -w2( 


dx p 

dA p 

dx p 


16 jx 


dA, 


3x p 


a j p vp 


K 2 F, y 

P.v 
1 




46rt 


s: 2 

P. V 


■Pv 


The expression in parentheses is F^p [see (61.5)]. Consequently, 
the formula for the mixed components of the energy-momentum 
tensor becomes 

n = — ET 2 V vp +4r 2 (69.11) 

P P. V 

To go over to contravariant components, we shall raise the index jn 
in all the terms of formula (69.11). Here 6 M V will transform into 
g pv [see (XII. 66) and (XII.67)]. Hence, 

= 2 + 2 (69-12) 

p P. v 


Raising of an index in one factor with simultaneous lowering of 
the same index in the second factor does not change the product. 
We can therefore write with equal grounds that 

r'-iJf’f’.+irJf!/ (69.13) 

p 3. v 

A comparison of expressions (62.12) and (62.13) allows us to 
conclude that the tensor T pv is indeed symmetric. 

Let us calculate the trace of the tensor 7^ v . According to for- 
mula (69.11), 

2 n - 2 ( -4- 2 V p ) + 2 { tk- « 2 v ,v ) 

up p u 3. v 
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In the second term, all I he factors except 6[J can be put outside the 
sign of summation over jl. The sum 2 $p equals four. Hence 

In P, P P. V 

It is not difficult to see that the last expression vanishes. We have 
therefore established that the trace of the tensor T^ v is zero: 

2 7^ = 0 (69.14) 

p 

Let us find the expressions for the components of the tensor 7^ IV 
in terms of the components of the vectors E and B. For this purpose, 
we shall introduce the values of the components F ,IV into (69.13). 
We shall take these values from (61.9), having in mind that lower- 
ing of the time index does not change the components of the tensor, 
while lowering of the space index reverses the sign of a component. 
We shall also take into account that the sum 2^ r pv-F tlv is an in- 
variant that we calculated in Sec. 63. It equals 

2 = 2 (B 2 — E 2 ) (69.15) 

JX, v 

[see formula (63.3)]. 

Let us begin with calculation of T°°. According to (69.13), (61.9), 
and (69.15), we have 

T 00 =-~2F 0p K -Y~g w 2{B 2 -E 2 ) 

p < 

= - W <- + TET 2 = sF ( e ‘ + » ! > - » 

[g 00 = 1; see (XII. 67)]. We have obtained an already known result: 
the component T 00 equals the energy density w [see formula (40.27)]. 
Now let us find the component T 01 . Since g 01 = 0, 

p 

' ± r l0-E x +(-E x )-0+(-E y )B z + (-E t )(-B y )) 

= ir ( £ y B « ~ E * B y) = { IT f EB j} x = T S * < 69 - 16 ) 

where S x is the x-th component of the Poynting vector (in a vacuum 
H = B). Similar calculations show that T 02 = S y /c, and T 03 — 
= Sjc [compare with formula (40.32)]. 
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Finally, let us calculate, for example, T n . According to (XII. 67), 
g 12 — 0. Therefore 

p 

= — [E x E y -{-0'( — B z ) + ( — B z ) -0 + B V B X ] 

— ■ 4 ^" (ExEy -f- B x B y ) = a xy 

where a xy is a component of the Maxwell stress tensor [see formu- 
la (59.12)]. It is a simple matter to verify that all the remaining 
components having the form T ih coincide with the relevant compo- 
nents of the tensor (59.12): 

T ih ^o lk = -^{^(E 2 + B 2 )6 ih -E i E h -B i B k } (69.17) 

[in verifying, it must be taken into account that g li — — 1 = 
= - 6 see (XII. 67)]. 

Let us consider the matter of diagonalization of the tensor T* v . 
In Euclidean space, such a transformation of a symmetric tensor 
is always possible. In pseudo-Euclidean space, however, matters are 
different, as we shall now see. According to (69.16), components 
of the form T oi = T i0 vanish provided that 

[EB] = 0 (69.18) 

According to (69.17), components of the form T ik (i ^ k) vanish 
provided that 

E t E h = 0 and BiB k =0 (i ^ k) (69.19) 

Therefore, for diagonalization of the tensor T ilv , we must transfer 
to a reference frame in which the vectors B and E are collinear,. 
or one of them vanishes [in which case condition (69.18) is satisfied].. 
We established in Sec. 62 that such a frame always exists except 
when B and E are mutually perpendicular and identical in magnitude. 
In this reference frame, one of the coordinate axes must be directed 
along the field. This will result in condition (69.19) being satisfied. 
The tensor will therefore acquire a diagonal form. Let us find the 
components T u assuming that the x-axis has been chosen in the 
direction of the fields and, consequently, 

E x — E, E y = E z = 0, B x = ±5, By = B z = 0 
Hence, by (69.17), 

T lt -.= ~{-j(E 2 + B 2 )-E2-B z )= ~-~{E 2 + B 2 )= — w 
T 22 = 7^33 = _L (E 2 + B 2 ) = w 


<6* 
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We know that the component T 00 also equals w. Being reduced 
to the diagonal form, the energy-momentum tensor of an electro- 
magnetic field is as follows: 


/w 0 0 0\ 

(T^) _ f 0 ~ W 0 0 I 

V ’ I 0 0 w o f 

\0 00 w) 


(69.20) 


If the vectors B and E are mutually perpendicular and identical 
in magnitude, the tensor T ^ cannot be given a diagonal form (the 
mutual perpendicularity does not allow B and E to be transformed 
so that they become collinear, and the equality of their magnitudes 
does not allow them to be transformed so that one of the fields 
vanishes). 

We must note that in mixed components, the tensor (69.20) is 


TO = 



0 0 0 \ 
w 0 0 

0 -w 0 
0 0 — wj 


(69.21) 


It directly follows from (69.21) that the trace of the tensor T^ v 
is zero. We obtained this result earlier in the general case [see 
(69.14)]. 


70. A Charged Particle in an Electromagnetic Field 


We know that the action for a charged particle in an electro- 
magnetic field is determined by the expression 

2 3 


S ~ j | me ds — ■— 2 A n dx* ) 


(70.1) 


n=o 


[see formula (66.1)]. 

The covariant components of the four-potential can be written as 


4|i = (q>, —A) 


[see (60.15)]. The components of the four-position vector are 


x v- = ( ct , r) 

Consequently, 

3 

2 ^ dx» — q>cdt — Adr — (ccp — Av) dt 

n=0 

Let us substitute this value o f the sum into formula (70.1). In 
addition, let us substitute c 1 — v 2 lc 2 dt for ds in accordance 
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with (34.4). The result is 
2 

f ( — me 2 ) f 1 - I-Vc* - y Av - eq- ) dt 
i 

Hence we conclude that the Lagrangian for a charged particle in 
a field is 

L — — me 2 Y 1 — f 2 /c 2 + Av — eqp (70.2) 

The first term is the Lagrangian for a free particle [see (39.4)]. 
The other two terms describe the interaction of the particle with 
the field. 

Knowing the Lagrangian, we can calculate the energy and momen- 
tum of a particle. According to formulas (4.19) and (5.1), the gen- 
eralized momentum is determined by the expression 

V-IL 

dv 


and the energy, by the expression 


T T7 dL 

W = — v- 
dv 


(70.3) 


Consequently, having differentiated the function (70.2) with respect 
to v, we obtain the generalized momentum of a particle: 


mv 


(70.4) 


Here p is the conventional momentum of the particle [see formu- 
la (38.5)]. A glance at formula (70.4) shows that the generalized 
momentum differs from the conventional one in the term ( etc ) A. 
In the absence of a field, the generalized momentum coincides with 
the conventional one. 

Now let us determine the energy of a particle. In accordance 
with (70.3) and (70.2), we have 

W ~ -4^- v — L — Pv — L 



mx 

/ 1 — I-2/ c 2 


A ) v — ( — me 2 Y 1 — v2 lc 2 + ~ Av — eq> ) 

mr 2 

ecp 


Y 1 — e 2 /c 2 


(70.5) 


The first term in (70.5) is the energy of a free particle [see formu- 
la (38.11)], the second term is the additional energy that a particle 
in the field has. 
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Replacing the velocity v in the expression for the energy in terms of 
the generalized momentum P, let us find the Hamiltonian of a par- 
ticle. To delete v from formulas (70.5) and (70.4), let us write these 
equations as 

W — eq> me 

i i c y l — v* le* 

P— — A = ■ , mv -- 

• c X 1 — v 2 /c 2 

If . we square these equations and subtract the lower one from the 
upper one, on the right we obtain m 2 c 2 . Consequently, substituting 
Sd for W, we have 

(^) 2 _(p_lA) 2 = m V 

whence 

= to«c*+(P— f A) 2 + ecp 

This is the Hamiltonian for a particle in a field. 

It was established in Sec. 32 that the components of the general- 
ized momentum equal the derivatives of the action with respect 
to the relevant generalized coordinates [see formula (32.6)]. In our 
case, the role of the generalized coordinates is played by the Carte- 
sian coordinates Consequently, 

p i=Sr or p==v5 

Further, according to (32.10), the derivative of the action with 
respect to time gives the Hamiltonian taken with the opposite sign: 



Substituting —dS/dt for and VS for P in formula (70.6), we 
arrive at the Hamilton-Jacobi equation for a particle in an electro- 
magnetic field: 

-^(lf + ec P) 2 -(v 5 — f A) 2 — mV-0 (70.8) 

In the classical approximation, i.e. when v -C c, the function 
(70.2) becomes 

L = + ± Av-e<p (70.9) 

twe have expanded (70.2) by the powers of i> 2 /c 2 and discarded the 
constant — me 2 ]. 


(70.6) 

(70.7) 
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Differentiation of (70.9) with respect to v yields the generalized 
momentum 

P = m\ +— A = p + jA (70.10) 

where p is the conventional momentum. 

For the energy in this approximation, we obtain 

w — Pv — L — y m\ + — A J v ^ Av -p eq> = -j- + eq> 

(70.11) 

From (70.10), we have 



Introducing this value of v into (70.11), we arrive at an expression 
for the Hamiltonian: 

«=-sr( p -T A ) ! +"T < 70 - 12 > 

The Hamilton-Jacobi equation in the classical approximation is 

-f +4r(^-T A ) 2 + e f"° < 70 - 13 > 




Chapter XIII 


ELECTROMAGNETIC WAVES 


71. The Wave Equation 


It was shown in Sec. 57 that upon application to the potentials A 
and cp of the Lorentz condition 


VA + f £-0 


(71.1) 


[see formula (56.8)1 they satisfy d'Alembert’s equation: 

□ A=— (71.2) 




(71.3) 


[see Eqs. 1 (57.6) and (57.7)]. Here □ is d’Alembert’s operator equal 
to 

0 2 , d 2 ep d 2 


n== A— = — 

u c 2 dt 2 dx 2 


(71.4) 


dy 2 ' dz 2 c 2 dt 2 

[see (57.4)]. 

In the absence of charges and currents (i.e. at p = 0 and j = 0), 
the equations for the potentials acquire the form 

□ A = 0 (71.5) 

□ 9 = 0 (71.6) 

or, with account of (71.4) 

32 A 

(71.7) 




c 2 dt 2 

a-p-£!? = o 


(71.8) 


Similar equations are also obtained for the vectors E and B: 

AE--^-g^ = 0 (71.9) 

AB -ir-Sr = 0 (71.10) 

(see Sec. 74). 

1 Recall that Eqs. (57.6) and (57.7) were obtained on the assumption that 
the medium in which the field is being considered is homogeneous and isotropic, 
and, in addition, that e and p do not depend on E and H. 
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Equations (71.7)-(71.10) have non-zero solutions. Consequently, 
electromagnetic fields can also exist in the absence of charges. Such 
electromagnetic fields arc known as electromagnetic waves. 

An equation of the kind 

4/-^-g- = 0 (71.11) 

where v is a constant, is called a wave equation. It is known from 
the general course of physics that v is the phase velocity of a wave. 
Consequently, the velocity of electromagnetic waves is 


where 


n 

n = Y e\i 


(71.12) 

(71.13) 


is the refractive index of the medium in which a wave propagates. 

It is a simple matter to obtain the wave equation in the four- 
dimensional form. Let us do this for the field in a vacuum. By (64.2), 
Maxwell’s equations in the absence of charges and currents are 


2 


dF» v 

dx v 


0 


(p = 0, 1, 2, 3) 


(71.14) 


The substitution for F of their values from (61.6) yields 



V 


or 


The first 


■yi d i A v d i A^ 

dxa 8x v 2* dx v dx v 

v ^ v 

sum can be written as follows: 


= 0 


(71.15) 


"V d dA v 

^ dxa dx v ~ dx n 2-i dx v 
V v 

If the four-potential satisfies the Lorentz condition (60.18), then 
2 (dA v ldx v ) = 0, and the first term in Eq. (71.15) will vanish. 
We thus arrive at the equations 


2^7 = ° (1-0,1, 2, 3) (71.16, 

V 

Raising of the index on dx v is equivalent to multiplying each 
of the addends in (71.16) by g vv [see (XII.30)]. Therefore, Eq. (71.16) 
can be given the form 


2# vv 


d 2 A» 


= 0 


dx v dx v 
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Finally, taking into account that the non-diagonal components 
of the tensor g vp are zeros, we can write 

= ° (H-0,1,2,3) (71.17) 

v,p 

Equations (71.16) and (71.17) are wave equations in the four- 
dimensional form. Substitution of values for g vp and x v in them 
results, as can readily be seen, in Eqs. (71.7) and (71.8). 


72. A Plane Electromagnetic Wave in a Homogeneous 
and Isotropic Medium 

The solution of the wave equation is considerably facilitated 
if the field depends on only one coordinate, say x. The wave in this 
case is called plane. Using a plane wave as an example, we can 
determine all the features of electromagnetic waves. For these 
reasons, we shall treat only plane waves in this chapter. 

Let / stand for any of the components of the vector potential A 
or the scalar potential cp (with equal reason, we can understand / 
to signify any component of the vector E or the vector B). For a plane 
wave, the function / depends only on x and t and, consequently, 
is a solution of the equation 


d 2 f eh d*f 0 

<9x 2 c 2 dt 2 


(72.1) 


This equation can be written as 

Df = 0 


, (72.2) 


where D is a differential operator determined by the formula 


5 2 ejr 3 2 

U ~~ <?x 2 ~ T 2 " ~W 

We shall represent this operator as 

Mw-f&Kw+fw) < 72 - 3 > 

where « = l/ep, [see (71.13)]. 

Let us introduce the new variables 




X 

c/n ’ 


T ) = t 


i.e. replace x and t by the formulas 



x 


n— l c 

2 n ' 


, h+S 

2 


(72.4) 


(72.5) 
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By (72.5), we have 


d 

d 

dx 

d 

dt 

c did 

c 1 d 

n d \ 

II 

dx 


dt 

1 

li 

2 n dx ' 2 dt 

2 n l dx 

~~ nr) 


Consequently, in going over to the variables 5 and q, the first factor 
in (72.3) must be replaced with — (2 nlc) dld\. 

Similarly, 

d d dx d dt c 51 d__ctd,nd\ 

dr\ dx di) ' dt dr| 2 n dx ' 2 dt 2n \ dx ' c dt ) 


so that the second factor in (72.3) must be replaced with (2 nlc) d!dr\. 
The result is 

D _ __4w 2 _5__5 4n 2 d 2 

c 2 d§ d r) ~ c 2 dS, d r| 


Let us introduce the found value of D into Eq. (72.2), discarding 
the factor —4 nVc 2 . The result is the differential equation 


d 2 f 

d£drj 


= 0 


(72.6) 


An obvious solution of this equation is a function depending 
on only one variable | or q, i.e. the function f 1 ( |) or / 2 (q). Summat- 
ing the functions f x and / 2 , we get a general solution of Eq. (72.6): 

/ a , to = u a ) + u (to ( 72.7) 

We did not multiply f 1 by C x and / 2 by C 2 because /j and / 2 are 
arbitrary functions of the relevant variables. 

Introducing into (72.7) expressions (72.4) for £ and q, we arrive 
at a solution of Eq. (72.1): 

(72.8) 

The first term in this expression is a wave running at the velocity 
dn in the direction of the ar-axis. Indeed, the values of are the 
same for all the values of t and x related by the expression 

t — const or a = —t — - const 

cln n n 

whence it follows that any preset value of the function travels 
along the z-axis at the velocity cln. 

Similarly, the second term in expression (72.8) is a wave travelling 
at the velocity cln in a direction opposite to the a>axis. 

The shape of the wave (72.8), i.e. the form of the functions / L 
and / 2 , is absolutely arbitrary. 

Consider a plane wave propagating in the direction of the x-axis. 
Let us choose the potentials so that 

<p = 0, VA == 0 (72.9) 
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Such a gauging of the potentials satisfies the Lorentz condition 
(71.1) and, consequently, does not affect the values of the vectors B 
and E (see Sec. 56). When we apply condition (72.9) to the poten- 
tial <p, we obtain 

E=-~ (72.10) 

[see (56.3)]. 

For the wave being considered, all the quantities characterizing 
a field including A have the form 

A — A(E) = A (t-JL.) (72. H) 

so that 

— 0 and = 0 (i = x, y, z) 

Particularly, dA y /dy and dAjdz vanish. Hence, the second of condi- 
tions (72.9) in the case being considered becomes 


Introduction of the last relation into Eq. (72.1) written for A x 
yields 


dt 1 


0 


According to the last equation 


— const 

dt 


(72.12) 


The derivative dAJdt determines the field component E x [see 
(72.10)]. Relation (72.12) therefore signifies that E x = const. We 
have thus arrived at the conclusion that a non-zero component E x 
can be due only to a constant and homogeneous electric field. Such 
a field has no relation to an electromagnetic wave. Consequently, 
we can consider that A x and, therefore, E x too, vanishes. 

Examination of (72.11) shows that 


j9A__aA dA dA n d\ 

dt ’ dx ~ d\ dx c 

According to the first of these formulas 


(72.13) 


1 dA __ 1 dA 

c dt c d\ 


(72.14) 


Since A depends only on the single coordinate x, in the expres- 
sion B = [vA] we must retain only the component V equal to 
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e x (d/dx). We can therefore write that 



or, according to the second of formulas (72.13), 

b =L'~(-t4t)] 

Finally, taking into account (72.14), we obtain 

B = n[e*, E] = ]/ep [e*, E] (72.15) 

whence it follows that the vector B is perpendicular both to the 
vector E and to the x-axis. We showed above that for a wave E x — 0 
and, consequently, the vector E is perpendicular to the x-axis. 
We thus arrive at the conclusion that in a plane wave the vectors E 
and B are perpendicular to the direction of propagation of the wave. 
Consequently, electromagnetic waves are transverse. 

In formula (72.15), e x is the unit vector of the direction in which 
the wave propagates. Designating this unit vector by the symbol 
k 0 , we arrive at the formula 

B = [k 0 , E] (72.16) 

that does not depend on our choice of the directions of the coordi- 
nate axes. 

Substitution of pH for B yields 

]/pH = [k 0 , l/eE] (72.17) 

It can be seen from (72.17) that the vectors H and E are mutually 
perpendicular, and their magnitudes are related by the expression 

VvlH = V7e (72.18) 

In addition, it is easy to conclude from (72.17) that the vectors E, 
H, and k 0 form a right-handed sequence. 

An electromagnetic wave carries an energy whose flux is deter- 
mined by the Poynting vector 



Let us express H in terms of E in accordance with (72.17) and use 
formula (VI. 5). Hence 

s I®. Ml = {k /f /I <“•> E 

Since the vectors E and k 0 are mutually perpendicular, Ek 0 = 0 
so that _ 

S = 4-\/’ - E 2 k 0 = ^=-e£ 2 k 0 

in r 4ji ]/ en 
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Using relation (72.18), we can write 


4n Y ep 


e£ 2 k 0 = 


4n Y ep 


ptf 2 k 0 


Yw 8n ° 


Finally, with a view to (eE 2 + \xH 2 )/ 8n being the wave energy 
density w, we obtain 


In a vacuum 


S = — c — wk 0 = v wk 0 

yep 

S = cwk 0 


(72.19) 

(72.20) 


Expression (72.19) agrees with the relation known from the gen- 
eral course of physics, according to which the density of the energy 
flux carried by a wave equals the density of the energy multiplied 
by the velocity of propagation of the wave. The direction of the 
energy flux density vector coincides with that of wave propagation. 
The momentum flux is determined by Maxwell’s stress tensor whose 
components are calculated by the formula 

o lft = ^ { E t D h + H t B t - 1 (ED + HB) 6 <ft } 

[see (59.12)]. Let us direct the z-axis along k 0 , the y-axis along E, 
and the z-axis along H (the vectors E, H, and k 0 will form a right- 
handed sequence). Consequently, E y = ±E, E x — E z — 0, D y = 
= ±D, D x = D z = 0, H z = ±H, H x = H y = 0, B z = ±B, B x = 
— B y = 0. It can be seen that with such a choice of the axes all 
the non-diagonal components of the tensor a ih vanish. Since E X D X 
and H X B X equal zero, we obtain that 


Oxx — 


ED+HB 

8n 


= — w 


Since E y D y — ED, and H V B y =* 0, we have 

Oyy = ±{ED-±(ED + HB)} = 0 
(by (72.18), ED = HB]. Similarly 

o Z2 = ± { HB - 1 (ED + HB)} =-■ 0 


Hence, with our choice of the coordinate axes, only one component 
of Maxwell’s stress tensor is non-zero — the component a xx , and 
it equals the wave energy density w taken with the opposite sign. 
We remind our reader that in accordance with the customary prac- 
tice in the overwhelming majority of manuals on electrodynamics, 
we defined the tensor o ih to characterize the flux of the momentum 
not flowing out of a given volume, but flowing into it (see Sec. 59). 
This is equivalent to reversing the direction of a normal to an area. 
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Hence, the momentum flux carried in the direction of the z-axis 
through the area df perpendicular to this axis is determined by 
a positive quantity, namely, o xx (—df) — w df. 

73. A Monochromatic Plane Wave 

A wave in which the field at each point varies with time according 
to a harmonic law (i.e. according to a cosine law) is called mono- 
chromatic. 

For a monochromatic wave, the solution of Eq. (71.11) is 

/ = a cos (cot + %) (73.1) 

where a is a quantity not depending on t, and ^ is a function of r. 
In this case, d 2 f/dt 2 = — co 2 /. Substitution of this value into (71.11) 
yields the equation 

A/ + ^/ = 0 (73.2) 

that determines how / depends on r. 

Let us introduce the quantity 

k = 4fko = f k 0 =y^k 0 (73.3) 

known as the wave vector (k 0 is the unit vector of the direction in 
which the wave is propagating). Equation (73.2) can now be written 
as follows: 

A / + k 2 f = 0 (73.4) 

In the following, we shall limit ourselves to a treatment of a plane 
monochromatic wave. Let us choose the z-axis in the direction of 
wave propagation. Equation (73.4) therefore becomes 

-§- + * 2 / = 0 (73.5) 

The following function will be a solution of this equation: 

f — a cos (dtkx + i]) 2 ) (73.6) 

where a is a quantity not depending on z, and i[) 2 is a function of t. 

Expressions (73.1) and (73.6) can be brought into agreement, 
assuming thatj 

f — a cos (of ± kx + a) (73.7) 

where a and a are quantities depending neither on t nor on z. The 
different signs of kx correspond to different directions of wave pro- 
pagation. We shall consider waves running in the direction of increas- 
ing z, and in this connection we shall write a minus sign in front 
of kx. 
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With an arbitrary choice of the coordinate axes, formula (73.7) 
becomes 

f = a cos (cot — kr + a) (73.8) 

Any quantity characterizing a monochromatic plane wave changes 
according to this law, particularly the vector potential A. We thus 
have 

A = A 0 cos (cot — kr + a) (73.9) 

Here A 0 is the amplitude, co is the frequency, k is the wave vector, 
and a is the initial phase of the wave. 

The expression (cot — kr + a), called the phase of a wave, is an 
invariant. Indeed, the fact that the field at a given point of space 
at a given instant acquired, for example, a zero value [cos (cot — 
— kr + a) = 0] cannot depend on the choice of the reference frame. 
It thus follows that for two arbitrarily chosen reference frames, the 
condition 

cot — kr + a — co'f' — k'r' + a' 
must be satisfied. 

If we assume that f = 0 and r = 0, it can be seen from the Lorentz 
transformations that f' and r' also equal zero. Consequently, we 
must have a — a' — inv, from which we conclude that the quantity 

O = cot — kr (73.10) 

also called the phase, is invariant. 

For an electromagnetic wave propagating in a vacuum, the phase 
(73.10) can be written as 

O = (ct) — kr = x° — kr (73.11) 

Since .<!> is an invariant, it follows from (73.11) that co/c and k form 
a four-dimensional wave vector 

^=(• 7 -, k) (73.12) 

This circumstance makes it possible to find the law of transfor- 
mation of the wave frequency co in passing from one inertial refer- 
ence frame to another. According to the first of formulas (36.1), 

/ 1-P 2 

whence after introducing the values k° = co/c. A: 1 = k x = 
= (co/c) cos O' (0 is the angle between the direction of wave propa- 
gation and the x-axis), and = vie (v is the velocity of the frame 
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K' relative to the frame K), we find that 

_ U) [1 — (v/c) cos 0] 

y i — u 2 /c a 

This formula for $ — 0 yields a formula for the longitudinal 
Doppler effect: 


co = 


CO (l — v/c) 

V 1 — 0 a /e* 


=»/r 


— d/c 


1 +f/c 


When -d = nl 2, we arrive at a formula for the transverse Doppler 
effect: 


co 


co 


Y l_y2/ c S 


We must note that since the magnitude of the wave vector is 
co/c, the square of the four-vector k^ vanishes: 

3 

2 k»k ll = (k°) z -k 2 = 0 

n=o 

Let us again turn to expression (73.9). It can be written as 
A = Re{A 0 e i <“‘- kr >} = Re{A} (73.13) 

where A 0 is a constant complex vector: 

A 0 = A 0 a*“ (73.14) 


and A is the complex vector in the braces [compare with (16.9) and 
(16.10)]. 

A change in the sign of the exponent in an expression such as e l ♦ 
does not change the real part of this expression. Formula (73.13) 
can therefore be written as follows: 

A = Re(A 0 e i < kr -“ , )} = Re(A} (73.15) 

In this case by the phase of the wave we must understand the expres- 
sion differing from (73.10) in its sign. In addition, the sign ought 
to be changed in the exponent of the quantity (73.14). But owing 
to the arbitrary nature of a, this may not be done. 

Expression (73.15) is in some respects more convenient than 
expression (73.13) and, in addition, is more similar to the wave 
function describing the motion of a free particle in quantum me- 
chanics than (73.13). 

It is a simple matter to see that in performing linear operations 
on the quantity A determined by formula (73.15), for instance 
differentiation, we can perform these operations on the complex 
vector A and then take the real part of the obtained quantity. For 


17—018 
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example, 

-£= Re {4}= Ri ><- i “ A > ( 73i6 > 

Let us calculate the curl of the vector A. This is a linear operation, 
hence 

[VA] = Re {(VA)} = Re {[V, Ke' «“-“'>]} 

= Re{[Ve i(kr - ta! >, A 0 ]} (73.17) 

Since kr = 2 kjXj, we have 

e I (kr— mi) _ ifc.gi (kr— <d4) 
dxj J 

Hence, 

ygi (kr-fflf) _ igi (kr-ffl!) _ jj^gi (kr -(at) 

3 

Introducing this value of the gradient into (73.17), we obtain 

[VA] = Re {[ike’ ( kr ~“0 t A 0 ]> 

= Re {[ik, A 0 e i ( kr - <s, )]} = Re{i[kA]} (73.18) 
Now we can write expressions for the fields E and B: 

< 73 - 19 ) 

B — [VA] = Re {i [kA]} (73.20) 

[if we had proceeded from expression (73.13), a minus sign would 
have appeared in the last two formulas). A comparison of them shows 
that the vectors B and E oscillate in the same phase. 

Having in view that k = (to Y efi/c) k 0 , formula (73.20) can be 
written as 

B'=Re ji [^]/ep-^- k„, A.J} = Re |v"ep [ k„, 

The' second factor in the last vector product is E [see (73.19)]. Hence, 
for the vector B whose real part gives B, we obtain the relation 

B = ]/ep.[k 0 , E] (73.21) 

Introducing the notation i (a/c) A 0 = E 0 , we can write expres- 
sion (73.19) as follows: 

E = Re {E 0 e» (73.22) 

If we direct the 2 -axis along the vector k, the vector E and, conse- 
quently, E 0 will be in the plane yz. It can therefore be written as 
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a linear combination of the unit vectors e„ and e z : 

E 0 = "T 

where | and T] are complex numbers. 

We shall show that upon the proper choice of a, the last expres- 
sion can be transformed as follows: 

E„ — (Eyo e y + E zo e z ) e~ ia (73.23) 

where E y0 and E z0 are real quantities. To do this, we shall write 
the following expression for the square of the vector E 0 which, 
generally speaking, is a complex number: 

E 2 = real number* e~ 2ia 

Let us represent the real number as the square of a vector quan- 
tity. Not only a real vector, but also the complex vector a ± ib 
may be this quantity provided that a and b are mutually perpendi- 
cular. Indeed, when ab = 0, 

(a + ib) 2 = a 2 -f- 2iab -f- b 2 = a 2 + b 2 = real number 


We have thus obtained the expression 
E 2 = (a + ib) 2 e~ 21a , 

whence 

E 0 = (a + ib) e~ ia (a _L b) 

Directing the y- axis along the vector a, and the z-axis along the 
vector b, we arrive at formula (73.23). 

Let us introduce expression (73.23) into formula (73.22): 

E = Re {(E y0 c v + iE z0 e z ) e~' 1 

(with the directions of the axes we have chosen, kr — kx). Hence 
E y = E y0 cos (co t — kx + a), E z = E z0 sin (at — kx + a) (73.24) 
Inspection of these formulas shows that 


-jh + -p~ = l (73.25) 

All our reasoning and all formulas beginning with (73.22) also 
hold for the vector B. 


Our result signifies that the vector E rotates in a plane perpendic- 
ular to the direction of propagation of the wave, its tip describing 
an ellipse (the vector B behaves similarly). The direction of rotation 
depends on what signs (identical or opposite) E yg and E z0 have in 
formulas (73.24). Such a wave is called elliptically polarized. At 
E y0 = ±E z0 , the ellipse (73.25) transforms into a circle. In this 
case, the wave is circularly polarized. Finally, one of the quantities 
Ey 0 and E z0 may be zero. Here, the vector E (and also B) is con- 
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stantly directed along the same straight line. The wave in this 
case is called linearly polarized or plane polarized. 

A monochromatic wave is thus polarized (elliptically, circularly, 
or linearly) in all cases. This signifies that the oscillations in the 
wave are ordered in some way or other. 


74. A Plane Monochromatic Wave in a Conducting Medium 

Equations (71.7) and (71.8) were obtained assuming that the 
medium contains no surplus free charges (p = 0) and no conduction 
currents (j = 0). We established in the preceding section that for 
a plane monochromatic wave the function (73.9) with a constant 
amplitude is a solution of Eq. (71.7). The failure of the amplitude 
to depend on the coordinates signifies that the propagation of a plane 
wave in a dielectric is not attended by a change in its intensity. 

Now let us assume that a medium has the conductivity o so that 
conduction currents j equal to oE can be produced in it. We shall 
consider as previously that surplus free charges are absent (p = 0). 
With these assumptions, Maxwell’s equations (55.10) and (55.11) 
will be written as follows: 

[VE] = — > VB = 0 (74.1) 

[ V B]=i^cxE + -^-^, VE = 0 (74.2) 

(we have substituted oE for j). 

Let us take a curl of the first of Eqs. (74.1): 

[V, [VE]]=44-f vB J (?4.3) 

According to (XI. 45), [V, [VE]] = V (VE) — AE. But VE = 0 so 
that only the second term equal to — AE remains. Let us substitute 
it for the left-hand side of Eq. (74.3). In addition, let us introduce 
into the right-hand side of this equation the value of tvB] from 
(74.2). The result is 

' . „ 4it(xa 3E . e|r d 2 E 

5T + 7T dt 2 


Let us write this equation as follows: 

Ar , ejr 3 2 E 4nna <5E _ n 

W dt 2 c 2 dt 


(74.4) 


Equation (74.4) is called a generalized wave equation. It differs 
from Eq. (71.9) in an additional term containing the first time 
derivative of the required function. When o = 0, this term vanishes, 
and Eq. (74.4) transforms into (71.9). 
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Considering the wave to be monochromatic, we shall seek the 
solution of the equation in the form 

E = Re{E(r)e- i “<} = Re{E) (74.5) 

where E (r) is a complex vector function of r, and E is the function 
in braces. Differentiation of the function (74.5) with respect to 
t yields 

= Re { - zcoE (r) e~ iat } = Re { - zcoE} 

- Re { - co 2 E (r) = Re { - co 2 E) 

Substitution into Eq. (74.4) of the values (74.5) and (74.6), after 
cancelling the common factor e~ m , results in the following dif- 
ferential equation for E (r): 

AE(r)+^E(r) + z^^E(r)==0 (74.7) 



Let us multiply the numerator and denominator in the third term 
by eco and substitute k 2 for epic o 2 /c 3 [see (73.3)]. As a result, combin- 
ing the second and third terms, we get the equation 

AE(r) + F(l + i^)E(r) = 0 (74.8) 

When o = 0, this equation transforms into an equation like (73.5). 

Let us represent the coefficient of E (r) in Eq. (74.8) as the square 
of the complex wave number k — k x - f- ik 2 : 

P( i + i^) = fri + iW=k t l -k\ + 2ik i k i 


Equating the real and imaginary parts, we arrive at a system of two 
equations for k 2 and k 2 : 


** — *; = ** 


2,kik% 


Anok* 

RIO 


(74.9) 


The solution of this system has the form 

f+Zl + C^g/etoP 

i 2= sj/H±Z 1 + (4it0/eco) a 


(74.10) 


When (4no/eco) 2 <C 1 (i.e. when a is small and co is large), for- 
mulas (74.10) become greatly simplified: the quantity k t can be 
taken equal to k, and in the expression for k. t we can assume that 
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j 

]/ 1 + (4na/eco) 2 « 1 + -j (4jta/eco) 2 , which yields the value 2jicr/c/eco 
for Zt a . Hence, at a low conductivity and a high frequency 

~ k = k + i ^T (74.11) 

Equation (74.7) can be written as 

AE (r) + fe 2 E (r) = 0 (74.12) 

where k — k x -f ik 2 is a complex quantity whose real and imaginary 
parts are determined by formulas (74.10). 

Consider a plane wave. We choose the z-axis in the direction of 
wave propagation. Therefore E (r) depends only oni, and Eq. (74.12) 
can be simplified as follows: 

i^+FE(r) = 0 

The following function is the solution of this equation: 

E (r) = E„e± i ' ,x (74.13) 

where E 0 is a constant complex vector. The plus and minus signs 
correspond to different directions of wave propagation. We shall 
be interested in waves running in the direction of the ar-axis. We 
shall therefore take the sign in the exponent (we wrote the 
factor depending on t as e~ iat ). 

Substituting the solution (74.13) which we have found into (74.5), 
we obtain the following expression for E: 

E = Re 

Let us represent the complex vector E 0 in this formula as E 0 e _la , 
where E 0 is a real constant vector. In addition, let us express the 
complex number k through its real and imaginary parts (k = k x + 
+ ik 2 ). The result is 

E = Re {E 0 e~ io e i (''i+ ift 2 >*e~ i< ‘ ,t } 

Let us write this expression as follows: 

E = Re (74.14) 

The real part of the expression in braces yields 

E = E 0 e~ ft «* cos (cot — k x x + a) (74.15) 

Expression (74.15) describes a wave whose amplitude diminishes 
according to the law e~ h * x . The phase velocity of this wave is 



(74.16) 
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Hence, the real part of the complex wave number k determines 
the phase velocity of the wave, and the imaginary part— the at- 
tenuation of the wave. 

According to (74.10), with slight attenuation « k — (co/c) Y ep, 
and the attenuation factor is 


2nak 2hct 

SCO c 


Vi 


(74.17) 


Let us obtain a solution of Eq. (74.7) in another way. We shall 
write this equation as 


At( r) + (c+^-( r) = o 


If we introduce the complex permittivity 

4:ia 


we can write 


e = e+ i- 


AE(r) + i£-co 2 E (r) = 0 


(74.18) 


Let us multiply this equation by e~ la)t and take into account that 
<o 2 E (r) e~ i0)t = — d 2 E/dt 2 , where E = E (r) e~ iu( [see (74.6)]. As 
a result, we arrive at the equation 


AE- 


ep 5 2 E 
e* dt* 


= 0 


(74.19) 


differing from Eq. (71.9) only in that the electric field strength and 
the permittivity are not real, but complex. The field E is the real 
part of the solution of Eq. (74.19). 

We found the solution of Eq. (71.9) in Sec. 73. It is 

E = Re {E 0 <r-«“<- ft *+“>} = Re {E} 

where k — Y £pco/c (the wave is assumed to run in the direction of 
the i-axis). Consequently, when e is real, the function 

fi = (74.20) 

is the solution of Eq. (74.19). 

Substitution of the complex value of the permittivity for the 
real one, e, does not change the form of the solution. But the real 
wave number k must be replaced in it with the complex number 
determined by the formula 
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where e is the quantity from (74.18). Substitution into formula 
(74.20) of the complex number k = k± + ik 2 for k leads to an expres- 
sion coinciding with (74.15). 

With a view to (74.18), the square of the complex number (74.21) is 

k 2 = epco 2 /c 2 = ^ e + i ) pc o 2 /c 2 

_ gpco* , . 4nqep w* _ j. 2 / , , . 4no \ 
e« + eco e 2 l ** eo> / 

This value coincides with k 2 in formula (74.8). 

Hence, with a conducting medium, both the wave number and 

the permittivity become complex. The refractive index n = ]/"ep 
is also complex. Let us write it as 

n — n + ix — ]/ r en = 'j/ r ( 8 + * H (74.22) 

Let us square this relation: 

(n + ix) 2 = n 2 — x 2 + 2mx= (e + i-^-) p = ep + i — 

We equate the real and imaginary parts: 

n 2 — >c 2 = ep, 2rax= '^ ITCre ^ - 

The system of equations obtained is identical with the system 
(74.9). Therefore, its solutions can be found by substituting ep 
for k 2 in formulas (74.10). The result is 

, , (74.23) 

x =Vi»V 

z 

A comparison of formulas (74.10) and (74.23) shows that 

(we have determined A: as co ]/” ep/c). 

Substituting for icj its value from (74.24) into formula (74.16), 
we arrive at the relation 



from which it follows that the real part of the complex refractive 
index n is the conventional refractive index n of the medium. The 
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second of formulas (74.24) shows that the imaginary part of n is 
proportional to the attenuation factor of the wave (x OC k 2 ). 

Up to now we dealt with the electric field of a wave. The magnetic 
field can be obtained from relation (73.21) by introducing the com- 
plex value of the permittivity and the complex function E into it. 
By (74.14), we have 

Consequently, 

B—\f ep [k 0 , 

The complex number ]/" ep can be written as 
|/^ep = n + ix = Y n 2 + xV’t’ 

where = tan" 1 (x/n). Taking advantage of this, we can write 
the formula 

B = Y n 2 + x 2 [k 0 , E 0 e- h 2*e- i (“ 1 - h i*+°-'t > )] 

from which it can be seen that the vectors B and E oscillate not 
in the same phase (as in a dielectric), but with the phase difference^ 
determined by the expression 

tan o|>=£ (74.25) 

The relation between the amplitudes of the electric and magnetic 
fields is determined by the formula 

B 0 = Yn 2 - f-x 2 E 0 

which after introduction of the values (74.23) for n and x becomes 
B 0 = E 0 y^ I]/ 1 + (^) 2 (74.26) 

[compare with (73.21)]. 

75. Non*Monochromatic Waves 1 

Any non-monochromatic wave can be represented as the superpo- 
sition of monochromatic waves of different frequencies. This opera- 
tion is known as the spectral decomposition of the wave. 

If the field of a wave is described by a strictly periodic function, 
it can be expanded into a Fourier series. In this case, the spectral 
expansion contains frequencies forming a discrete series of values: 
coi, co 2 , © 3 , . . . . The frequencies co n ( n =£■ 0) are integral multiples- 

1 Before beginning to read this section, acquaint yourself with Appendix XIV. 
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of the fundamental frequency co 0 , i.e. co n = ni o 0 . The fundamental 
frequency is determined by the period T of the function describing 
the field: g>o = 2nlT. 

Consider a plane non-monochromatic wave propagating in a vac- 
uum in the positive direction of the x-axis. Any quantity charac- 
terizing such a wave (E, B, etc.) is described by the function 

/ = /(*-f) (75.1) 

Isee (72.8)1. Upon fixing x, we obtain a function of t describing 
the oscillations of the field at a given point. We have presumed that 
this function j is strictly periodic. Assume that its period is T: 

1 (t + T) = / (t) 

According to (XIV. 11), we can write / ( t ) as 

+00 

f,(t) = 2 Cne-^ot (75.2) 

71= -OO 

where C n is a constant calculated by the formula 

+T/2 

j f (t) e ln “ ot dt 

-T/2 

Isee (XIV.14)]. 

According to formula (XIV.15), the expansion of the function 
/ (£) can also be written as 

+oo 

f(t)= 2 C*e in <M (75.3) 

71= -oo ' 

The average intensity of a wave is proportional to the average 
value of the square of E or B, i.e. is proportional to the expression 

+ T/2 / +°° \ / +oo \ 

<[/(*) 1 2 >=^- I ( 2 C n e~ ina> ot j ( 2 Cme imb>ot \ dt (75.4) 

— T/2 'n=-oo f \m=-oo ' 

The' integrand equals [/ (t)] 2 . We have taken one of the factors 
in the form of (75.2), and the other in the form of (75.3), and in 
the second sum denoted the dummy index by the letter m instead of n. 
Let us transform expression (75.4) as follows: 

+oo +T/2 

<[/(*) P> = 2 °nC* m ~ j gl(m-n)Mot dt 

7i, m*=- oo —T/2 

- 2 C n c* m 8 nm = 2 C n C*~ 2 |C n P=C 0 2 +2 2|C n | 2 (75.5) 

71, 77l=-oo 71= — oo 71= -OO 71=1 
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[we have taken advantage of the orthogonal nature of the system 
of functions e in “o‘; see (XIV.13)]. 

The result we have obtained signifies that the average intensity 
of a non-monochromatic wave is composed of the intensities of the 
monochromatic components. 

Now let us consider the case when a wave and, consequently, 
the oscillation /(f) too exists during a limited time interval (and 
not from t = — oo to t — + oo). In this case, / ( t ) is not periodic 
and can be expanded not into a series, but into a Fourier integral 
containing a continuous series of different frequencies. By (XIV.22) 

+oo 

/ (t) = -L= [ C^e-i^dco (75.6) 

y 2 r J 


where C a is a function of the frequency co determined by the expres- 
sion 1 

c -=7%V®‘ mdi < 75 ' 7 > 

— OO 

[see (XIV. 21)]. It is evident that 

= CZ (75.8) 

The total intensity of a wave during the time from — oo to +oo 
is determined by the expression 

-f oo -f oo 

H = j [f(t)]>dt= j f(t)f(t)dt 

— oo — oo 

Substitution of expression (75.6) for one of the factors yields 

— oo — oo 

In the formula we have obtained, it is assumed that integration is 
first performed over id, and then over t. Let us change the sequence 
of integration, i.e. write 

M c • (tsT /«)-•*■*} *> 


1 The integration variable in formula (75.7) is customarily designated by 
the letter t. We have used a different letter for it to stress that C a is a quantity 
not depending on t. 




268 


ELECTRODYNAMICS 


A comparison with (75.7) shows that the integral inside the braces 
is CLm, i.e. CJ,. Consequently, 

+ 00 -f- 00 00 

Iz= j C a CU «= j |CJ 2 <fo = 2 { |C„|*d® (75.9) 

— 00 — 00 0 

It follows from the formula we have obtained that the quantity 
j C a | 2 characterizes the fraction of the total intensity per unit 
interval of frequencies. 

The radiation of a charge experiencing deceleration can be men- 
tioned as an example of a field that can be expanded into a Fourier 
integral. 




Chapter XIV 


RADIATION 

OF ELECTROMAGNETIC WAVES 


76. Retarded Potentials 

We assumed in the preceding chapter that charges and currents 
are absent, and in this connection we presumed that the right-hand 
side in d’Alembert’s equations (71.2) and (71.3) equal zero. Now let 
us turn to studying time-varying fields in the presence of arbitrarily 
moving charges. In this case, the potentials of the field satisfy the 
equations 


Acp- 

1 d 2 q> 
c 2 dt* 

— 4np 

(76.1) 

AA- 

i a 2 A 
c 2 dp 

4 n . 

(76.2) 


It is known from the theory of linear differential equations that 
the general solution of a non-homogeneous equation equals the 
sum of the general solution of a homogeneous equation and a partic- 
ular solution of a non-homogeneous one. General solutions of 
homogeneous equations were studied in the preceding chapter. 
Consequently, to obtain the general solutions of Eqs. (71.1) and 
(71.2), it is sufficient to find their particular solutions. 

Let us divide the entire space containing charges and currents 
into elementary volumes dV' and determine the field set up by 
each of the charges de contained in a given dV' . Owing to the linear 
nature of the equations, the required field will be the superposition 
of the fields set up by all the charges de. 

The charge de contained in a given elementary volume dV' is, 
generally speaking, a function of time: de — de ( t ). If we pay no 
attention to the presence of other charges de, the charge density 
due to the point charge being considered can be written as 

p' (r, t ) — de ( t ) 6 (r — - r') (76.3) 

where r' is the position vector determining the position of the charge 
de ( t ). We have designated this density by the symbol p' to distin- 
guish it from the density p(r, t) determining de(t) by the formula 
de(t) = p (r, t ) dV'. 

Substitution of expression (76.3) into Eq. (71.3) yields 

A ?-^^=~ 4j[de (*) 6 ( R ) ( 76 - 4 ) 

where R = r — r'. 
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At all points except the one for which R = 0, the density p' = 0, 
and Eq. (76.4) has the form 


A «P— F 


1 <9 2 cp 


dt 2 


= 0 


(76.5) 


It is obvious that in this case the field has central symmetry relative 
to the point R = 0 and, consequently, is a function only of R. 
Let us therefore write Eq. (76.5) in spherical coordinates. Using 
expression (XI. 88) for the Laplacian, we obtain 


1 d 
R 2 dR 


(*•#)- 


1 


c 2 dt 2 


= 0 


We shall seek the solution in the form 

t) 


<P = - 


R 


When this condition is observed, we have 

3<p ip 1 9\j) 


dR 


R 2 


R 


!>+■«!!) = 

_J!_ / _ 

dt 2 \R ) 


d 2 (f 

dt 2 


dR 

3\p . 3 \Jj 
"dR + lR 
1 5 2 ip 
R dt 2 


(76.6) 


(76.7) 


/? _ D 

dR 2 dR 2 


The introduction of these values into Eq. (76.6) and the cancelling 
of 1/R yield the equation 1 

d 2 \ |) 1 3 2 ij) q 


As we established in Sec. 72, the general solution of such an equa- 
tion has the form 

* = /i(*-4) + /*(* + t) < 76 - 8 > 

[see formula (72.8); in the case we are now considering, n = 1], 
The first term in (76.8) describes an expanding spherical wave, 
and the second term describes a spherical wave converging at the 
point R — 0. We are interested in a particular solution. Let us 
take the first term of formula (76.8) as such a solution. Introducing 
it into (76.7), we find an expression for <p: 

<p(r, *) = — ‘a** 0 (76.9) 


The solution (76.9) satisfies Eq. (76.5) with an arbitrary choice 
of the function } (t — R/c). Let us try to choose this function so that 

1 Remember that the point for which R — 0 has meanwhile been omitted 
from consideration. 
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expression (76.9) will also satisfy Eq. (76.4) at the point R = 0. 
We must note that at Jf? -vO the function (76.9) tends to infinity. 
Consequently, its derivative with respect to the coordinates be- 
comes very large for small R's, so that the term (1/c 2 ) (d 2 qp/df 2 ) may 
be disregarded in comparison with A«p and Eq. (76.4) can be written 
as 

Acp = — 4 nde ( t ) 6 (R) 

We have arrived at Poisson’s equation (42.4) for the potential 
of a point charge. Hence, near the point R = 0, the function (76.9) 
should transform into an expression of the kind cp = e/R. This 
will occur if we assume that / (t — R/c ) — de (t — R/c). Conse- 
quently, the required solution is 


de (t — R/c) 

R 


(76.10) 


It is evident that in the direct proximity of the point R = 0, this 
expression becomes 


To find the potential for the arbitrary distribution of charges 
described by the function p (r, t), let us summate the solution 
(76.10) over all de = p dV ' . We thus obtain 


<P 


(r, t)=J 




I r — r 


(76.11) 


Equation (76.2) differs from (76.1) only in the right-hand side 
containing j/c instead of the function p. We can therefore directly 
write an expression for A, by analogy with (76.11): 


l ( r,,i 


r — r 


dV’ 


|r — r' 


(76.12) 


Expressions (76.11) and (76.12) are called retarded potentials. 
The name is due to the fact that the values of the potentials at the 
instant t are determined by the values of p and j at earlier instants 
in advance of t by the retarded time x — | r — r' \/c needed for the 
electromagnetic disturbance to reach the point r from the point r'. 

To obtain the general solution of Eqs. (76.1) and (76.2), it is 
necessary to add to the retarded potentials the general solutions 
of the homogeneous equations that were found in the preceding 
chapter. These solutions are not related to the field produced by 
the system. They describe the external field acting on the system 
and superposed onto the field set up by the system. 




272 


ELECTRODYNAMICS 


In the stationary case (i.e. when p and j do not change with time), 
formulas (76.11) and (76.12) transform into expressions (41.10) and 
(48.7), respectively. 

We must note that if we had chosen the function / 2 as the partic- 
ular solution [see (76.8)], we would have obtained advanced poten- 
tials instead of retarded ones. We shall not stop to consider this 
in greater detail. 


I 

77. Field of a Uniformly Moving Charge 

Assume that the charge e is moving at the constant velocity v 
relative to the reference frame K. Let us associate the reference 
frame K' with the charge. This frame, like the charge, moves rela- 
tive to the frame K at the velocity v. 

The charge is at rest relative to the frame K' . Consequently, 
the potentials of the field in this frame are 

q>(r',i') = f, A(r',O=0 (77.1) 

The most general solution would be A = const, but owing to gauge 
invariance, this constant could be taken equal to zero. 

To find the potential in the frame K, let us transform the four- 
vector 

A» = ((p, A) 

to the frame K. For this purpose, we must take the formulas (36.2) 
for the inverse transformation, according to which 

p ^ . <P(r', <') + P4*(r', t') _ <p (r', t') 

’ ' /l-P 2 /l-P* < 

A It t\ = fcPfr', ( f '. *') = P*P(r't *') 

x( ’ ' /1HP 

A y = A' v — 0, A z — A' z —0 

Introducing for cp in these formulas its value from (77.1), we obtain 


<p(r, t) 

A x ( r, t) 


e 

r' Y lTTp* 
(vie) e 

r' y 


(77.2) 


Since only the component A x differs from zero, while the vector v 
is directed along the x-axis, the expression for the vector potential 
can be written as 

gy 

cr' Y 1 — F 


A (r, t) 


(77.3) 
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Now in formulas (77.2) and (77.3), we must pass over from the 
primed coordinates (i.e. from r') to unprimed ones. According to 
the Lorentz transformations (35.14) 


Consequently, 


. x ■ — vt 

X — — , 

j/T-p 2 


y' —y, z' — z 


r' = ]/ x ’ 2 y ' 2 + z ' 2 = }/ [{x-vt)lV\-f > 2 ] 2 + y 2 + z 2 

=- y-'hw /fr-^+a-P 2 ) (y z + z2 ) 

Substitution of (77.4) into (77.2) and (77.3) yields 

q>(r, t)— ■■■/■: e = • 

V(X- *)»+ (1 - p 2 ) (y 2 + * a ) 

A (r, t)=s — v ■■■■■ ■ : - 6V ■ - ■= . 

c /(:r — t>i ) 2 + (l — p 2 )(y 2 + z 2 ) 


(77.4) 


(77.5) 

(77.6) 


A comparison of (77.5) and (77.6) shows that the following rela- 
tion exists between the poten- 
tials: 

A = — - (77.7) 


Formulas (77.5) and (77.6) can 
be simplified by expressing them 
in terms of the length of the vector 
R drawn from the charge to the 
point of observation, and of the 
angle 0 between the direction of 
this vector and the ar-axis. If we 
begin to measure the time from 
the instant when the charge was 
at the origin of coordinates, we can see from Fig. 77.1 that 

R 2 = (x - vt) 2 + y 2 +z 2 
Therefore, by (77.4), we have 

The ratio (y 2 + z 2 )/R 2 equals sin 2 d (see Fig. 77.1). Consequently, 

r' = .JLz - 1/ 1 — 6 2 sin 2 
Y 1-P 2 v 



18-018 
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Introducing this value into (77.2) and (77.3), we obtain 


9 


e 

R Y 1 — P 2 sin 2 d 


A = 


ev 

cR Y 1 — P 2 sin 2 ^ 


(77.8) 

(77.9) 


Knowing the potentials, we can use the formulas 

E=- V .p (77.10) 


B = [VA] 


(77.11) 


[see (56.3) and (56.1)] to calculate the fields E and B. Writing 
(77.10) in components, we obtain 

E — dtp 1 dAj 

1 dxi c dt 


Hence, 


E x =- 


dtp 

dx 


J_ dA x dtp 1_ 0 / v(f \ dip v dtp 

c dt ~ dx c dt \ c ) ~dx "c 2 ~dt 


Ey= ~ 


dtp 
dy ’ 



(77.12) 

(77.13) 


According to (77.5) and (77.6), <p and A have the same form 


'M*’, 


i) = 


a 

/(x-ttf) 2 +( 1-P 2 ) (y 2 + z 2 ) 


f 


where a is a constant (scalar or vector). Introducing the notation 
x — vt — £, we can write that 

dip dip d| dip dip dip dg dip , . 

' ~dx df ~dx ~ d|" ’ Tt dT1)f~l)t'~ V ' 


whence we conclude that 


dip _ dip 

dt dx 


(77.14) 


Substituting for dy/dt in (77.12) its value from (77.14), we arrive 
at the formula 


E 


X — 


dtp v dtp _ fdtp , w 2 dq> dtp , A Q , A 

dx c 2 dt dx c 2 dx ~ dx * 1 P > 


(77.15) 
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Introduction of the derivatives with respect to the coordinates 
of expression (77.5) into formulas (77.15) and (77.13) yields 

e (x — vt ) \ 


E x = ( 1-P 2 ) 
£ V = (1-P 2 )- 
= ( 1-P 2 ) 


[(* — »/)* + (!— p*) (y* + z*)] 3 ' 2 


ey 


K *— P *) ( y s +- 2 )] ;i/2 


[( «_.rt)*+ { l-p*) (^-F * 2 )] 3 ' 2 
These formulas can be written in the vector form: 

<R 


E== (1 — P 2 ) 


R 3 (1 — ^2 Sin2 ^)3/2 


(77.16) 


(77.17) 


where R is a position vector drawn from the point where the charge 
is to the point of observation (see Fig. 77.1). 

For the magnitude of E, we have 


E 


e(l- P 2 ) . 

R* (1 — P 2 sm 2 fl) 3/2 


(77.18) 


A glance at this formula shows that on the axis along which the 
charge is moving (i.e. at = 0 and jt), 


E 


e(l — P 2 ) 
R* 


(77.19) 


and in directions perpendicular to the velocity of the charge (i.e. at 

ft == n/2), 


E = 


e 

& l/jirp 


(77.20) 


The field flattens, as it were, in the direction of motion of the charge, 
and to a greater extent with increasing v. 

Let us find the magnetic field. By (77.11) and (77.7) 


B = [VA] = [ V , ^] = |[V<P, v]=-i[v, V<p] 


(recall that v is constant). 
According to (77.10), we have 


— V<P = E 


i dA 
c dt 


E 


1 d (q>v) _ p . V a<p 
c 2 dt c 2 dt 


(77.21) 


(77.22) 


[we have taken into account relation (77.7), and also the circum- 
stance that the velocity is independent of t}. 

We introduce expression (77.22) for — V<p into (77.21): 


B = i| v E] + ±0, 


18* 
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The second term is the vector product of collinear vectors and there 
fore vanishes. Hence, 


B = |[vE] 


(77.23) 


We see that the vector B at each point is perpendicular both to the 
vector v and to the vector E. The vectors v and E are collinear 
on the x-axis. Consequently, on this axis, B = 0. 

If we introduce expression (77.17) for E into formula (77.23), 
we obtain 


B = (1~P 2 )| 


[vRl 

R 3 (i_pa sin* $) 3/2 


(77.24) 


When Kc, Eq. (77.24) transforms into the formula 


B 


« [vRl 

c R3 


(77.25) 


We must note that expressions (77.17) and (77.25) can be obtain- 
ed with the aid of formulas (62.2) for the transformation of fields, 
proceeding from the fact that E' = er'/r' 9 and B' = 0 in the 
frame K'. We invite our reader to do this as an exercise. 


78. Field of an Arbitrarily Moving Charge 

Let us find the field of a charge moving with acceleration. Assume 
that motion occurs along the trajectory r 0 = r 0 ( t ) (Pig. 78.1). 

We shall calculate the field at the point 
of observation P determined by the posi- 
tion vector r. Owing to retardation, the 
potentials at the point P at the instant 
t are determined by the position and 
velocity of the charge not at the same 
instant t, but at the earlier instant t 0 . 
The latter must satisfy the condition 

cx — c (t — t 0 ) — R ( f 0 ) (78.1) 

where R ( t 0 ) is the distance from the 
point at which the charge is at the 
Fig. 78.1. instant t 0 to the point of observation 

P. The retardation is determined by 
the time x needed for the perturbation to reach the point r from 
the point r 0 (to). 

It is a simple matter to see that cx and the vector R (t 0 ) form the 
four-vector 

R* = (ct, R) (78.2) 

Indeed, the four-vectors 

x» = {ct, r} and x$ = {ct 0 , r 0 (t 0 )} 
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are two four-position vectors. The vector (78.2) equals the difference 
between these two four-vectors and, consequently, is a four-vector 
itself. 

It can be seen from (78.1) that the square of the four-vector (78.2) 
is zero: 


S R* 7?,, = c 2 t 2 — /? 2 — - 0 


(78.3) 


(this expression is in essence the square of the interval between the 
events of the appearance of a signal at one point and its arrival at 
another point). 

Let us associate the reference frame K' with the charge. In this 
frame, the charge is at rest and, consequently, the potentials are 
determined by the expressions 


cp'(r', = A'(r', t') = 0 


(78.4) 


where r' and t' are the position vector of the point and the time of 
observation determined in the frame K ' , and R' is the distance be- 
tween the point where the charge is and the point of observation. 
Hence, the four-potential of the field in the frame K' is 


-4'»=(A-, 0) 


(78.5) 


Let us attempt to find an expression for the four-potential such 
that would transform into expression (78.5) when v = 0. It is evi- 
dent that in an arbitrary frame, the potentials should depend on the 
velocity v of the charge. Since we are seeking a four-dimensional 
expression for the potential, the velocity must be taken as the four- 
velocity 


ut* 


-( 71 = 


Y l — I7 2 /c 2 ’ /l— V*J, 


— v 2 /c 2 / 


If we assume that the four-potential is proportional to the four- 
velocity, at v — 0 the vector potential A will also be zero. Further, 
the denominator of the time component of the four-potential (78.5) 
contains the time component of the four-vector (78.2). Consequently, 
the four-vector Rv- should be introduced into the denominator of 
the required expression for A and for the correct dimension to be 
obtained, it must be multiplied by the four-velocity (we have already 
introduced the four-velocity into the numerator). 

■The four-potential must thus be determined as follows: 


A*-. 


eu* 


2* v “v 


(78.6) 
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Substitution of values for R v and u v yields 
R v u v — . - 1 ■ — (c 2 t — Rv) 

V 

Consequently, expression (78.6) can be written as 

A* = V7=*l2- l gj l ; 

whence 

n£tu) < 78 - 7 > 

It is not difficult to note that when v = 0, expression (78.7) does 

indeed transform into (78.5). 

Taking into account that cx = R, let us write the expressions 
for the potentials as follows: 

?<'• < 78 - 8 > 

v=7B%rr' f * <78 - 9) 

(we have used the notation p == v/c). 

Expressions (78.8) and (78.9) are known as the Lienard-Wiechert 
potentials. To obtain the values of the potentials at the instant t, 
the values of v and R on the right-hand sides of (78.8) and (78.9) 
must be taken at the instant t 0 determined by the condition 

to = (_ T = t_M = t _ilzhWL 

u c c 

Let us write this condition as 
F ( x , y, z, t, t 0 ) = t — t 0 

{[x—x 0 (t 0 )l 8 -H.y— y 0 (t 0 ) l 2 +[z— »o(io )] 2 ) 1/2 _q (7810) 

The right-hand sides of (78.8) and (78.9) are functions of t 0 , and 
the latter, in turn, is a function of x, y, z, and t (here x, y, z are 
the coordinates of the point of observation): 

t 0 — f (z, y, z, t ) (78.11) 

Relation (78.10) is an implicit expression of the functional rela- 
tion (78.11). 

When finding the values of the fields by the formulas 
E ~ Vcp j —■ , B = [VA] 

it is necessary to calculate expressions of the form dcp/dxj, dA/dt, 
and dAJdxt, where r ; is the coordinate of the point of observation. 
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Since the functions (p and A depend on x, y, z, . in a complicated 
way — in terms of t 0 = / ( x , y, z, t), we have to perform the calcu- 
lations according to the following scheme; 


3c p 

3cp 

dtp 

dxi 

dt„ 

dx/ 

3A _ 

3A 

dtp 

at 

dt 0 

dt 

dA-k _ 

dAf, 

dtp 

3xj 

dtp 

dxi 


It follows from relation (78.12) that 


(78.12) 

(78.13) 

(78.14) 


V<P = 


3(p 

dtp 


Vt o 


(78.15) 


We shall therefore need the values of the derivatives dtjdxi and 
dtjdt, which can be found with the aid of relation (78.10). Let us 
write this relation as 


F (x^, X 21 x$j 1 1 ) — t to 


(S — (*o)Pj 1/2 
c 


= * — = 0 (78.16) 


According to the rules for the differentiation of an implicit func- 
tion, we have 

dtp _ dFIdij 

dxi dF/dtp 

Differentiation of (78.16) with respect to t 0 yields 


dF . ’Z(xi—x 0i )(dx 0i ldt 0 ) 

dtp ~ 1 + cR 

Taking into account that x t — x oi = R t , and dx ol /dt 0 is the i-th 
component of the velocity of the charge at the instant t 0 , we arrive 
at the expression 


dF _ ■ R v(<o) i?-Rp 

dt 0 cR R 


(78.17) 


Similar calculations yield 


dF 

dxi 


Ri_ 

cR 


(78.18) 


Dividing (78.18) by (78.17) and reversing the sign, we obtain 

dt 0 Ri 
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■whence it follows that 

Vt ° ~ ~ < 78 - 19 > 

Now let us find dt 0 /dt. According to the relevant'rule for the differ- 
entiation of an implicit function, we have 

!0f„ _ dFIdt 

dt ~ dF/dt 0 


The derivative of the function (78.16) with respect to t is unity: 
dFIdt — 1. The derivative dF/dt 0 is determined by expression (78.17). 
Consequently, 


dt a __ R 
dt R— Rp 


(78.20) 


Now we shall commence calculating the values of dq>/dt 0 and dA/dt 0 . 
By (78.8), we have 

3<p _ e / dR 5R « „ 0p \ 

HTg (R Rp) 2 \lt^~ dig P~ K Wg ) 


We can see from the relations R = c (t — t 0 ), R = r — r 0 ( f 0 ) that 


<>R__ 9R 

3t, C ’ dtg 



• • • 

In addition, d$/dt 0 — p = v/c, where v is the acceleration of the 
charge at the instant t 0 . Hence, 


0<p _ e(-c+vP-RP) ee(i-P* + Rp/c) 
dt, ~ (H-RP)> ~ (R — Rp)* 


(78.21) 


By (78.9), we have 


<?A #<p 

dt 0 dt 0 




Subsfitution of expression (78.21) for d(p/dt 0 and (78.8) for q> yields 

dA «ep(l-p* + R0/e) , ep . „ p (1 - P 2 + Rp/c) + (p/C) (R - Rp) 
dt, ~ (R — Rp)* ^fl-Rp - ^ (R — Rp)* 

(78.22) 


Finally, we can write an expression for E: 



1 3A dtg 
c dt 0 dt 
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[see formulas (78.15) and (78.13)]. Introducing expressions (78.21),. 
(78.19), (78.22), and (78.20) into this equation, we obtain 

F _ ec (i-p» + Rp/e) r R 1 

(/? — Rp) a L e(7?-Rp)J 

. p (l- + np7c) + (p/c) (/? -Rp) /? 

(i?-RP)» * R - Rp 

. R{l-P 2 + Rfi/ C )-^{l-^ + Rp/c)-(/?p'/c)(fl-RP) ,, c oov 

-< (H_RP)3 (I*. £6} 

By grouping together the terms containing the acceleration v 
of the charge (i.e. p), we can give the expression for E the form 


F== _ (R~.BP)(1-P 2 ) , . (Rp/c)(R-i?p)-(J?p/e)(fl-Rp) 

(R — RP) S (R — RP) 3 


The numerator of the second term can be written as the vector triple- 
product [R, [(R — i?P), p/c]]. This can be verified by expanding 
the product according to formula (VI. 5). As a result, we obtain the 
final expression 


F (R-i?P)(l-P 2 ) . [R, [(R-m P/c]] 

(R — Rp) 3 ' re (i?-Rp) 3 


(78.24) 


• • 

Remember that the values of R, p (i.e. v), and p (i.e. v) in this for- 
mula must be taken for the instant t B = t — x. 

The field described by expression (78.24) consists of two parts* 
The first depends only on the velocity of the charge and at large dis- 
tances diminishes as 1/i? 8 (i.e. as a Coulomb field). The second part, 
besides the velocity of the charge, also depends on its acceleration 
and at large distances diminishes as MR (i.e. as the field strength in 
a spherical electromagnetic wave). 

If the charge moves uniformly, the second term in formula (78.24) 
vanishes, and the field is determined by the expression ; 


17 . (R 0 — it 0 P) (l — p 2 ) i 

iL - e (/?o— R 0 P ) 3 


(78.25) 


Here the subscript “0” on R stresses the circumstance that the value 
of R is taken for the instant t 0 . We have not used the subscript “CT 
on P because in uniform motion P does not depend on t. 

In Sec. 77 we obtained formula (77.17) for the field of a uniformly 
moving charge. It can be written as 


E = 


R(i-P 2 ) 

R :> (1 — P 2 sin 2 #) 3 / 2 


i [(78.26) 


Formulas (78.25) and (78.26) greatly differ from each other in their 
appearance. It is a simple matter to show, however, that they are 



282 


ELECTRODYNAMICS 


actually identical. The matter is that in formula (78.25) the field 
is expressed in terms of the distance from the charge to the point of 
observation taken at the instant t 0 — t — t. In formula (78.26), 
however, the field is expressed in terms of the distance from the charge 
to the point of observation taken at the instant of observation t. 

To prove that formulas (78.25) and (78.26) are identical, let us 
consider Fig. 78.2. The distance OP is R 0 , eP equals R, and the seg- 
ment Oe is the path travelled by the charge during the retarded time x. 
Taking into account that x = RJc, this path can be written as i? o p. 



The vectors R 0 , R, and i? 0 p are related by the expression R = 
== R 0 — 7?oP» whence the identity of the numerators of formulas 
(78.25) and (78.26) follows. 

The length of the segment OQ equals the projection of the vector 
/? 0 p onto the direction of the vector R 0 : 

OQ — i? 0 p cos a = R o p r 

Consequently, 

QP ~ Ro — RoP (78.27) 

{compare with the denominator of formula (78.25)1. 

Let us express the length of the segment QP in terms of R. A 
glance a£ Fig. 78.2 shows that 

(QP)* = R 2 — b* = R 2 — (tfoPIsin a) 2 

Further, it also follows from the figure that R 0 sin a — R sin f>, 
so that 

(QP) 2 = R 2 - (i?p sin fl) 2 

whence 

(?i> = 7?/l_p2sin 2 # (78.28) 

Finally, equating the right-hand sides of expressions (78.27) and 
(78.28), we arrive at the relation 

TZo — R 0 P = 7? Kl — P 2 sin 2 d 
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from which the identity of the denominators of formulas (78.25) 
and (78.26) follows. We have thus proved the identity of expres- 
sions (78.25) and (78.26). 

Let us pass over to finding the field B. By formula (XI. 56), we have 

b-ivai«[v(.. 

(A depends on the coordinates in terms of t 0 , like a in formula (XI. 56) 
depends on the coordinates in terms of 11. 

Introducing (78.19) and (78.22) into the last expression, we obtain 


f R (l~p«+RP/c) + (p/c) (fl-RP) 

L c(fl — RP) ’ (/?-Rp)* 

_r_R - (fl-RP) i 

1. B ’ e (/?— RP)3 J 


Let us add in the numerator of the second factor the term R (1 — 

— P 2 + Rp/c). This will not change the expression because [RR] = 
= 0. But the second factor will now transform into E [see (78.23)]. 
We thus arrive at the relation 

B-PI/J] < 78 ' 29 > 


Here R = R ( t 0 ). Examination of (78.29) shows that the vector B 
at each point is perpendicular to the vector E and to the vector drawn 
from the point where the charge was at the instant t 0 to the point of 
observation. 


79. Field Produced by a System of Charges 
at Great Distances 

jAssume that we have a system of moving charges that do not leave 
the confines of a certain volume in their motion. We shall presume 
that the system as a whole is neutral. Let us consider the field pro- 
duced by such a system at distances that are great in comparison with 
its dimensions. We shall place the origin of coordinates inside the 
system and characterize the distribution of the charge with the aid 
of the function p = p (r', t). Hence, the charge inside the volume 
dV' at the point with the position vector r' will be de ( t ) = 
= p (r', t) dV' . Let r stand for the position vector of the observation 
point P.In addition, we shall introduce the notation R = r — r'. It 
is obvious that R is the vector drawn from de to the point P. 
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Let us write expressions (76.11) and (76.12) for the retarded poten- 
tials of the field produced by the system: 

0 (r, (79.1) 

A (r, f) = | j Hr '’ (79.2) 

According to our assumption, r r'. Therefore, the quantity 
R — | r — r' 1 can be considered as the value of the function / (r) = 
= | r | = r at the point r + Sr, where 8r = — r'. Taking advan- 
tage of the formula / (r + fir) = / (r) + V/ (r) 6r, we can write 

R = |r — r'| =r + Vr( — r') = r — L r ' = r — nr' (79.3) 


where n is the unit vector of the position vector r. 

Substitution of (79.3) for R in the formulas for the potentials 
yields 


cp(r, t)= j 

A ( r * *) = | J 


p ( r >, 

r — nr' 
r— nr' 


(79.4) 

(79.5) 


We see that the retarded time r consists of two parts. One of them, 
equal to t 0 = r/c, does not depend on r' and is called the retarded 
time of the system. It determines the time needed for an electro- 
magnetic perturbation to travel the path from the origin of coordi- 
nates to the point of observation. The second part, equal to t = — nr'/c, 
is called the proper retardation. It characterizes the time needed 
for propagation of the perturbation within the limits of the system. 

Let us expand the integrand in (79.4) into a series in the ratio 
r'/r. Considering the quantity —nr' as the small increment fir 
of the argument r, we obtain 


p ( r 'l 



'I 

1 

B 

1 

+ 7F 

p(r', t-1.) 

r 


(-nr')-f 


(79.6) 


To see whether we can limit ourselves to the written terms in the 
expansion, we must assess the following terms. The latter will con- 
tain higher derivatives of p with respect to r. It is easy to see that 
the derivatives of p with respect to r are proportional to the deriva- 
tives of p with respect to t. Indeed, assuming that t — r/c = £, we 
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can write 

dp 

dr 

whence 


Similarly 

£p 

dr 2 

The consecutive differentiation of the function p/r with respect 
to r with the following replacement of the derivatives with respect 
to r with those with respect to t yields 

J_ /£) _ __Pi ± ( iP 
dr \ r ) r 2 r V c ) dt 

d 4 / p \ 2p 2/ M j?P i J_ / 1 d* p 

dr 4 V r / r 3 r 2 \ c / dt r \ c / <9t 2 



<?r m l r / ~ + r \ c j 


(in the last line we have written only the last term). 

At large values of r, the first terms in the derivatives we have writ- 
ten are much smaller than the last ones. Therefore, our task con- 
sists in assessing the relative magnitude of expressions of the form 

r t -7 r < — -o’ = f (4 r is- to” < 79 - s > 

[in expansion (79.6), the m-th derivative is multiplied by (6r) m ]. 

Assume that p varies with time according to the harmonic law 
p oo cos at. Therefore, the m-th derivative of p with respect to t 
will be of the order of co m p. Substitution into (79.8) yields 

•£(£)>- 

where l are the linear dimensions of the system of charges being con- 
sidered. 

The following term of the expansion will be of the order 

_p ^ a>l 

Thus, the ratio of the consecutive terms of expansion (79.6) in the 
order of its magnitude is to He. Replacing the frequency with the 
period of the variation of p (by the formula co = 2n/T), we obtain 

o il 2jiZ l 

c ~ cT ~ cT 
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What has been said above shows that the subsequent terms in the 
expansion (79.6) may be disregarded if the condition 

- 4 - < 1 < 79 - 9 > 

is satisfied. The ratio He determines the proper retardation time x'. 
Consequently, condition (79.9) can be written as 

x ' < T (79.10) 

It can be seen from (79.10) that we may limit ourselves to the first 
terms of expansion (79.6) when the time needed for the propagation of 
an electromagnetic perturbation within the limits of the system is 
much smaller than the time during which the distribution of the 
charges in the system changes appreciably. 

Condition (79.9) can be written in two other ways. The product cT 
gives the wavelength X of the radiation produced by the system. There- 
fore, inequality (79.9) can be written as ^ ^oA 

1<^X (79.11) 

lthe dimensions of the system must be much smaller than the wave- 
(ength). 

Finally, having in view that II T in the order of magnitude equals 
the velocity v of the charges in the system, instead of inequality 
(79.9) we can write 

v < c (79.12) 

A glance at the last relation shows that by interrupting expansion 
(79.6) at the second term, we have limited ourselves to considering 
the radiation of a non-relativistic system of charges. 

Let us again turn to the calculation of the potentials, assuming 
conditions (79.10)-(79.12) to be satisfied. The substitution of (79.6) 
into (79.4) yields 

^(r, t) = f Jp(r', Jp(r', *—L) nr'dF'} 

(79.13) 

(we remind our reader that integration is performed over the primed 
coordinates, therefore r can be put outside the integral; in addition, 
we have changed the sequence of differentiation with respect to r 
and of integration). 

The density of the charge at the instant t — (r/c) is inside the 
first integral. Consequently, this integral gives the total charge of 
the system, which owing to the presumed electroneutrality of the 
system is zero. We must therefore retain only the second term in 
formula (79.13). Putting n in it outside the integral and the deriva- 
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tive, we arrive at the expression 

<p(r, {4 ( P (r'. t-±)r'dV'} 

The integral in this expression is the dipole electric moment which 
the system had at the instant t — rlc: t>-/ ^ />0/ 

i /W P (*—7)= J P ( r '» *— 7 ) T ' dV ' 

[compare with (43.6)1. We can therefore write 

<p(r, ] (79.14) 

Finally, having performed differentiation and taken into considera- 
tion that dp/dr = —(He) dp/dt = —(He) p [compare with (79.7)], 
we obtain 

q>(r, , np (t-r/c) 

The first term in this formula coincides with the potential (43.9) 
of astatic dipole (n = r/r). We must note that the field corresponding 
to this term at the distance r and the instant t is determined by the 
value of the dipole moment at the instant t — rlc. The first term dimin- 
ishes with an increasing distance r much more rapidly than the 
second term. Therefore, considering the field at great distances, we 
can assume that 

<p(r, t ) = ll SL -lJ.il ( 79 . 15 ) 

Let us go over to determination of the vector potential. Formu- 
la (79.2) differs from (79.1) only in that j (r', t — Rlc) is inside the 
integral instead of p (r', t — Rlc). Consequently, by expanding the 
integrand into a series, we obtain an expression similar to (79.13): 

A(r, 0-^r t-^) dV-±- {± j j(r', ‘ — 7 ) (m')dV' 

(79.16) 

If the currents were stationary, i.e. did not depend on t, the first 
integral would vanish [see (51.5)1. For non-stationary currents, how- 
ever, this integral differs from zero. We may therefore retain only the 
first term in expansion (79.16) 1 . We can thus assume that 

A(r, = (6 

* In formula (79.13), we could not disregard the second term because the 
first one vanished. 
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We shall prove that j j (r\ t — r/c) dV' equals the time deriva- 
tive of the dipole moment of the system taken for the instant t — r/c. 
It will be the simplest to prove this by passing from the continuous 
distribution of the charges to a discrete one. Let us perform the sub- 
stitution 

J i dV' = j p y d V'-+^e a v a 

(the velocities of the charges, like the function pv, must be taken 
for the instant t — r/c). However, 

2 e a \ a = 2 = -ft 2 e « T '« = P ~ r ! c ) 

Consequently, 

A (r, (79.17) 

A comparison with (79.15) allows us to write 

q> = An (79.18) 

The potentials (79.15) and (79.17) are determined by the value 
of the time derivative of the dipole moment of the system. This is 
why they are called potentials calculated in a dipole approximation. 
The dipole approximation is allowable when the conditions (79.10)- 
(79.12) are observed. 


SO. Dipole Radiation 


The region of a field that is at a distance r from the radiating system 
much greater not only than the dimensions of the system l, but also 
than the radiated wavelength (r > X > i) is known as a wave zone. 

In this zone, conditions are observed in which the dipole approx- 
imation treated in Sec. 79 holds. In this approximation 


np ( t — r/e ) 


_ p ( t — rlc ) 
cr 


(80.1) 


(see (79.15) and (79.17)]. 

To calculate E, we must find V<p and dA/dt. Using formula (XI. 51), 
we obtain 


V<P = 


dr 


Vr = 


dcp 

dr 


(recall that n = e r = r/r). Hence, 




np (t — r/c) 


)n=- 


°P 

cr 2 


n-JSLn 

c*r 


cr 


(80.2) 
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[we have taken advantage of the circumstance that dp I dr — 
— — (1/c) dp/dt]. 

The first term in the expression we have obtained diminishes with 
an increasing distance much more rapidly than the second one. It 
may therefore be ignored for great distances, and we may consider 
that 

Vcp^- ^X-r^ .n 

r c-r 

• • 

The derivative dA/dt = pier. Hence, 

E = — Vcp — — dA — (np) n 1 P (np)n — P 

* c dt c' l r c cr c-r 

• • 

The numerator of this expression can be written as [n [np]]. This 
can readily be verified by expanding the vector triple product using 
formula (VI. 5) and taking into account that nn = 1. The electric 
field is thus determined by the formula 

E =-^r [n, [npj] = ^r[[pn], n] 

Let us go over to calculation of the magnetic field. The vector 
potential is a function of r. Therefore, by (XI. 56), we have 

B=[VA| = [vr, £] = [n, £] (80.31 

Differentiation of expression (80.1) for A yields 

dA __ d / p ( t — r/c ) \ _ p_ p_ 

dr dr l cr I cr 2 c 2 r 

[compare with (80.2)]. Discarding the term proportional to 1/r 2 , we 
find that dAldr = — p/c 2 r. Hence, 

B— --^r [np] =p 7 [pn] 

Wo shall write the final expressions for E and B: 

E == -qr l[pn], n], B,-~[pn] (80.4) 

• • . 

(remember that the values of p must be taken for the instant t — r/c). 
A comparison of these expressions leads to the conclusion that 

E = [Bn] (80.5) 

whence it follows that the vector E is perpendicular to the vector B. 
Examination of expressions (80.4) shows that the vectors E and B 
are perpendicular to the vector n [the perpendicularity of E to n 


1 9 — 01 1 > 
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can also be seen from (80.5)]. Hence, as in a plane wave, the vectors B, 
E, and n are mutually perpendicular [see formula (72. 16) 1 ]. In addi- 
tion, the vectors B and E, as in a plane wave, are identical in magni- 
tude, and 

E -- B = 1 p U ° (80.6) 

• • 

where ft is the angle between the directions of the vectors p and n. 

That the relations observed for a plane wave were found to hold 
for the field we are studying is not surprising. At distances that are 

great in comparison with the 
dimensions of the radiating sys- 
tem, a wave must be spherical. 
At the same time, provided that 
r^>h, small portions of the 
spherical wave virtually coincide 
with a plane wave. 

It can be seen from (80.4) that 
the fields E and Bare determined 
by the second derivative of the 
dipole moment of the system. 
This is the reason why the radia- 
tion being considered is called 
dipole radiation. 

The dipole moment is deter- 
mined by the expression p= 

Consequently, p = 2er' = 2 ev - It thus follows that charges emit 
electromagnetic waves only when they move with acceleration. 

To comprehend the pattern of the field at great distances, let us 
introduce a spherical system of coordinates, measuring the polar 

angle ft from the direction of the vector p (f — r/c) (Fig. 80.1). By 

(80.4), the vector B is perpendicular to the plane determined by the 

♦ • 

vectors p and n. Consequently, B is directed along a tangent to a 

“parallel”, the vectors p, n, and B forming a right-handed system. 

Examination of (80.5) shows that the vectors B, n, and E form a 
right-handed system. Hence, it follows that E is directed along a 

tangent to a “meridian”, the directions of p and E on the equator 
being opposite. We stress once more that the vectors depicted in 

Fig. 80.1 relate to different instants: p to the instant t — r/c, and B 

1 We multiply relation (80.5) by n and use formula (VI. 5): 

[nE] = [n, [Bn]] = B (nn) — n (nB) = B 
We have arrived at formula (72.16) (in a vacuum Yep. = 1, H = B). 
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and E to the instant t. The magnitudes of the vectors B and E are 
proportional to sin ft [see (80.4)1. Therefore, the fields have the maxi- 
mum value at the “equator” and vanish at the “poles”. 

To determine the intensity of radiation in different directions and 
the total radiated power, let us calculate the Poynting vector. With 
a view to (80.6), we obtain 




p' 2 sin 2 ■© 
4ne 3 r 2 


n 


(80.7) 


Hence, the intensity of dipole radiation is proportional to sin 2 ft. 
The diagram showing the intensity as a function of $ has a two-lobecL 
pattern. ^ 

To determine the radiated power P, let us find the 1 'energy flux 
through the entire spherical surface. The area of a spherical band of 
width dft is 2nr 2 sin id dft. Consequently, -Jprz , J j 0 '~ L 

P = j s dt . = | 2nr 2 sin ft dft = -g- (80.8) 

o 


Assume that of all the charges of the system only one has accele- 
• • • • 

ration. Hence, p = 2^e a y a — e\ and the radiated power is 


P = 


2e 2 v 2 

3c 3 


(80.9) 


This formula also holds when there is only one charge moving with 
the acceleration v. 


81. Magnetic Dipole and Quadrupole Radiations 

If the properties of a system of charges are such that p = 0, no 
dipole radiation is produced. This does not signify, however, that 
there is no radiation at all. In this case, we must take into account 
the terms of the expansion of the potentials which we disregarded in 
the dipole approximation. 

We established in the preceding section that in the wave zone a 
wave in small regions is close to a plane wave, for which relation 
(80.5) holds. This relation makes it quite simple to find E if we know 
B. To find B, on the other hand, it is sufficient to know only A. 
We shall therefore limit ourselves to finding the vector potential. 

Consider the second term of formula (79.16) which we disregarded 
in the dipole approximation. Since in the case we are interested in 
the first term in (79.16) is zero, the vector potential is 

A <r.‘>- -£{4- [,('■• ‘--r) ("'')■ dV } 
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Differentiation with respect to r yields two terms. One of them is pro- 
portional to 1/r 2 , and the second is proportional to 1/r. The first term 
diminishes with the increasing distance much more rapidly than the 
second one. We therefore disregard it, as we have already done many 
times. In addition, we take into account that d] /dr = — (1/c) dj/dt. 
The result is 

A ( r - i) = -Jqr ~§F j 3 (r' , t - ric) (nr') dV' 

To simplify our further calculations, let us pass over from a con- 
tinuous distribution of the charges to a discrete one. The expression 
for A therefore becomes 

A ( r > *) = &-£ 2 e <» v a(nr;) (81.1) 

a 

The values of v Q and r' a are taken for the instant t — rlc. 

The expression \ a (nr;) can be written as follows: 

v H (nri) = (r; (nr;)} — r' a (nv a ) (81 .2) 

Let us divide v u (nr;) into two equal parts and replace one of them 
with half of expression (81.2): 

v a (n^) = 4 v “ ( nr «) + t 4r — I r “ ( nv °) 

The first and third terms can he written as a vector triple pro- 
duct: -^-[n, [v a , r^]]; this can be verified with the aid of formula 
(VI. 5). Hence, 

v a (nr^ = -|- [n, [v 0 r;j] +4lF ( nr a)} 

Substitution of this expression into formula (81.1) yields 

A ( r > = 2 e a(n. [v a r;j] 

a 

* 

+^4 r '£'2 e a r “( nr “) == a "*+ a c ( 81 - 3 ) 

a 

The meaning of the subscripts m and Q will be revealed below. 

Let us put n outside the sum sign in the first term and exchange 
the places of \ a and r;. As a result, this term becomes 

2c 2 r H7 (4’ 2 e ° ( FnV aJ J “ [4 ~c 2 e<1 ( r « v al> n ] 

a a 
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The first factor is the magnetic moment m of the system [see for- 
mula (51.15)1. We can therefore write that 

A m = ~ [mn] (81.4) 

where in = dmldt is taken for the instant t — r/c. The potential 
A m is thus determined by the changes in the magnetic dipole moment. 
Therefore, the relevant radiation is called magnetic-dipole radiation. 
For magnetic-dipole radiation, the magnetic field is 

B„-[VAJ = | vr. = 

[compare with (80.3)]. If we disregard in dA m /dr the term propor- 
tional to 1/r 2 and replace d/Or with ( — 1/c) d/dt [recall that m = 
== m (t — r/c)], we obtain 

B m= — -jqr [». [mn]]=-^ r [[mn], n] 


Using relation (72.16) equivalent to (80.5), we find that 
E m = — -^r [mn] = l»m] 

Hence, the fields in magnetic-dipole radiation are 
Em = c i r [nm] , 

B m = 77(!mn], »] 


(81.5) 


A comparison of the results obtained with formulas (80.4) for dipole 
radiation shows that B m is expressed in terms of m by a formula simi- 
lar to the one expressing E in terms of p. The formulas for E,„ and B 

differ, apart from the substitution of m for p, in the sign. It thus fol- 
lows that a pattern of the field in magnetic-dipole radiation in the 

wave zone can be obtained by substituting in for p, B m for E, and — E m 
for B in Fig. 80.1. 

Let us turn to the second term of formula (81.3), i.e. to the expres- 
sion 

A « = ¥7¥Ev'*K) ( 81 - 6 ) 

a 

The addition to A Q of the expression / (r) n [here / (r) is any function 
of r] will not change Bq because [V, / (r) n] = 0 [see (XI. 54)]. 
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Let us take as / (r) the function 

/(')= — e^r-Jr 2 

a 

[r' a is a function of the argument ( t — r/c ), which when acted upon 
by the operator V behaves like a function of r]. Multiplication of 
this function by n and the addition of it to (81.6) yields 

a q = eX w 2 e a {3r; (nr;) - r» 


Let us write an expression for the i-th component of the vector A Q : 

A Qi = ~^r^j2 2 e a{3^i ( 2 nk*ik)— r >i} 

a k 

The last term in the braces can he written as ^r' a 2 8 ih n h . Hence, A Ql 

k 

will appear as follows: 


Aq i -- 


l 




6 c 2 r dt 2 


2 e a {3*ai ( 2 H h<k) ~ ( 2 r '°&iknk )} 


l d 2 
6 c 2 r dt 2 


2 U h 2 { 3 x'aiX'ah ~ r'aS ih } 


But the sum over a is Q tk — a component of the tensor of the quad- 
rupole moment of the system (see formula (43.13)]. Consequently 

4 1 d 2 -<n n 1 d* ^ 

(2 Z-l Qih n k — ' (j c 2 r gt t Qi 


: 


6c 2 r dt 2 


where is a component of the vector obtained when the vector n 
is multiplied by the tensor Q ih . Denoting this vector by the symbol 
Q, we can write 

Aq = wQ (81.7) 

The potential A Q is determined by the changes in the quadrupole 
moment of the system. For this reason, the corresponding radiation 
is called quadrupole radiation. 

The magnetic field of quadrupole radiation is 

Bq = | VA C 1 = [ Vr, ^] = [«, 


In calculating dA Q /dr, we disregard the term proportional to 1/r 2 
and replace the derivative with respect to r with one with respect to 
t. The result is 


B Q 


1 

6c 3 r 


[Qn] 


(81.8) 
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(we have transposed the factors to avoid writing the minus sign in- 
troduced by the factor —He). 

Taking advantage of relation (80.5), we find that 

E Q = 5 ~[[Qn], n] (81.9) 

In comparison with formulas (80.4), expressions (81.8) and (81.9) 
have the additional factor l/6c. In addition, they contain Q instead 

of p. Otherwise, the formulas for the fields in quadrupole radiation 
are identical to the relevant formulas for dipole radiation. 

Calculations show that the total power of all three kinds of radia- 
tion (including dipole radiation) is determined by the expression 

l’ = lJrP’ + 1 Jr^ + w2'<?!<. (81.10) 

i, h 

Let us assess the relative intensity of various kinds of radiation. 
We shall assume for simplicity that the charges of the system move 
according to a harmonic law. Hence 

r f = l cos cof, v = l(o sin c of 

where l is a quantity of the order of the system’s dimensions. The 
average values of the magnitudes of cos rot and sin c ot are quantities 
of the order of UEity. Therefore in the final expressions characterizing 
the order of magnitude of the expressions being considered, we shall 
discard the factors cos o ot and sin (ot. For instance, if we do not have 
in mind the necessity of time differentiation, we can write that 

r' ~ l, v ~ Ico (81.11) 

It can be seen from definition (43.5) that p is a quantity of the 
order of er ' , i.e. 

• • • 

p ~ ef cos cof, p~ elco sin of, p ~ el(o z cos cot 

Hence we find that with respect to the order of magnitude, the power 
of dipole radiation is determined by the expression 


P P 



e’-r-w* 

c3 


eVco 2 
c3 ' ' 


(81.12) 


[see (81.11)1. 

It follows from definition (51.15) that the magnetic moment has 
a magnitude of the order of er'v/c, i.e. 


1 1 

m ~ — el 2 co cos co£ sin co£ el 2 co sin 2c ot 

c c 

• 1 . . I 

— el 2 co 2 cos2cof, m el z co 3 sin 2<nf 

/» • f 
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whence 


m 2 e 2 Z , co e 

c 3 c 5 


e 2 e 4 <d 2 

c 5 


(81.13) 


A comparison of expressions (81.12) and (81.13) shows that 



Recall that we have calculated the fields of radiated waves for the 
non-relativistic case (i.e. for v <C c). Consequently, in the case we 
have studied, the intensity of magnetic-dipole radiation is much 
lower than that of electric-dipole radiation. 

According to definition (43.13), the quadrupole moment (and, 
therefore, the vector Q) is a quantity of the order of er' 2 , i.e. 

Q ~ el 2 cos 2 a )t, 

Q el 2 a cos at sin at ~ el 2 a sin 2at 

Q ~ el 2 cd 2 cos 2© t, Q ~ e 2 l 2 a 3 sin 2 at 


Hence, 



e 2 / 4 co 6 e 2 y 4 co 2 

C 5 C® 


(81.14) 


Quadrupole radiation thus has an intensity of the same order of 
magnitude as magnetic-dipole radiation. 
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I. Lagrange's Equations for a Holonomic System 
with Ideal Non-Stationary Constraints 


For non-stationary constraints, conditions (4.2) have the form 
fi (x x , x it . . ., x n , f) = 0 (l = 1, 2, . . r) (1.1) 
Accordingly, the time also enters functions (4.3): 

X i = (<Ji, (J 21 • • •> Qat f) (f — 1> 2, . 

Another term appears in formulas (4.4) and (4.5): 

dxi , xv dxs 


n) 


Xf 


s 


dt X-l dq t 
l 

From (1.3), we obtain the relation 

dxs dxs 


Qi 


( 1 . 2 ) 

(1.3) 


dqi 


dqt 


(1.4) 


coinciding with (4.6). Formulas (4.7) also remain unchanged: 


dxi 


0 


dqh 


(1-5) 


Since the quantities dxi/dq k contain not only </*, but also t, ex- 
pression (4.8) becomes somewhat more involved: 

d dxj_ _ J_ / dXj \ 1 d 1 dxj \ ' 

1h dt [ dq h ) ' XJ dq, \ dqh ) ^ 1 


dt dqh 


d-Xj ^ d 2 n ' 

dqh ^ dqi dq k ]l 


dtdqi, 


( 1 . 6 ) 


Differentiation of expression (1.3) with respect to q h yields 


dx i 

dqh 




dxi 


d'xi 


dqh dt 


d~X( 

dqh dq t 


Qi 


i i 

Comparing this expression with (1.6), we arrive at the relation 


dx i 
dt lh 


d dx i 
dt dqh 


(1.7) 


coinciding with (4.9). 
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Let us multiply the equations 

2 * •••■») 

{see (4.1)1 by dx t /dq h and summate them over i: 

/ d dL \ dn \\ dL dx t _ \\ D dx t , ^ p* dx t /T ox 

2j\ dt * ) dq k S-i dx i dq h ~~ 2j N t gq k + g qh t 1 - 8 ) 

i ax ‘ i i i 

Since relations (4.9), (4.6), and (4.7), used in Sec. 4 in the transfor- 
mation of the left-hand side of this equation, remained unchanged 
{see (1.7), (1.4) and (1.5)], the result will also remain unchanged. 
Consequently, the left-hand side of (1.8) can be written as 

d dL dL 

dt d - qk 0q k 

{see the paragraph preceding formula (4.15)]. 

Before starting to consider the right-hand side of formula (1.8), 
let us discuss the following matter. With stationary constraints, 
Xi = x t ( q h ), and the increment of the coordinate x t during the time 
dt will be 

(i.9) 


where dq h are the increments of the generalized coordinates during 
the time dt. With non-stationary constraints, x t — x t ( q h , t), and 
the increment of x t during the time dt is , 




(1.10) 


where dq h are again the increments of the coordinates q h during the 
time dt. * 

If the constraints were to suddenly stop changing, dx-Jdt would 
vanish, and formula (1. 10) would coincide with (1.9). Consequently, 
the second term in (1. 10) is the imaginary displacement of the system 
that would be obtained with “frozen” constraints. It is called the 
virtual (or possible) displacement and is designated by 8x t . Hence, 
the true displacement dx t can be written as the sum of the virtual 
displacement 8x t and an addend equal to ( dxjdt ) dt: 

d Xl = 8x t + ^dt ( 1 . 11 ) 


With stationary constraints, the virtual displacement coincides with 
the true one. 
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Now let us consider the first term on the right-hand side of for- 
mula (1.8). Multiplying it by dq k , we obtain 

it i 

where 5x ; is the virtual increment of the coordinate x t appearing 
when only one generalized coordinate q k changes. 

The work of the reactions R t with fixed non-stationary constraints 
(as with stationary constraints) is zero. Therefore, T^R^Xi — 0, 

i 

and since dq h ^ 0, it follows from (1.12) that 


VI d & x i 

2j Ri ~d^T 


(1.13) 


We must note that the true work of the reactions of non-stationary 
constraints according to (1. 11) and (1.13) is 

2 R t dx t = ^ RM + 2 R, dt = 2 R t dt 

i i i i 

and, generally speaking, is non-zero. 

Hence, according to (1.13), the first term on the right-hand side 
of formula (1.8) is zero — the reactions of the constraints have again 
vanished from the equations. As regards the second term, by defi- 
nition it is the generalized force Qt [see formula (4.12)]. Consequently, 
for non-stationary constraints too, we have arrived at Lagrange’s 
equations 

d dL dL 

II. Euler's Theorem for Homogeneous Functions 

A function of any number of variables is called a homogeneous 
function of these variables of the degree m if upon multiplying all 

the variables by the arbitrary quantity a, tho function is multiplied 

by a m , i.e. 

/ (az lt az 2 , . . ., ax n ) = a m f (x x , x 2 , , x n ) 

Differentiation of this identity with respect to a yields 


d(c lx t) II. * * * * * * * X i~ mam V(*.. *2, •••. ■'») 


Assuming that a = 1, we obtain 


2 dxt mf {x i, x z , ..., x„) 
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We have proved Euler’s theorem, which states that the sum of 
the products of partial derivatives of a homogeneous function and the 
corresponding variables equals the product of the function itself and the 
degree of its homogeneity. 


111. Some Information from the Calculus 
of Variations 

1. Functional. If to each number x belonging to a certain class 
there corresponds another number y, it is general knowledge that 
we have to do with the function y — y ( x ). 

If to each function y (x) belonging to a certain class of functions 
there corresponds a certain number <J>, the functional O [t/ (a:)] is 

said to be set. 

For purposes of clarity, we shall some- 
times speak of a curve instead of a func- 
tion. 

Hence, a function establishes the corres- 
pondence: 

number -*■ number 

whereas a functional establishes the cor- 
respondence: 

function (or curve) -v number 

Consequently, when dealing with a func- 
tional, the role of the argument is 
played by a function (or a curve). 
Let us explain what has been said above by means of the following 
example. Assume that we are given two fixed points 1 and 2 in the 
plane x , y (Fig. III.l). The distance Z 12 between the points measured 
along the curve joining them is a functional. To find an analytical 
expression relating the quantity l 12 to the function y — y (x) de- 
scribing the curve, we shall take into account that the element dl 
of the curve is related to dx and dy by the expression dl 2 = dx 2 + dy 2 . 

Writing dy as y' (x) dx , we obtain dl — Y dx 2 + [y' {x)] 2 dx 2 = 

= Y 1 -f \y' (x)] 2 dx. Finally, integrating and designating Z 12 by 
dD [y (x)l, we arrive at the expression 

2 

O [*/(*)] = j 1/1 + [y' {x)\ 2 dx (III.l) 

1 

Taking different curves, i.e. different functions y (x), we shall ob- 
tain different numbers <D. 
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In the same way as there are functions not of one, but of several 
variables, there are functionals depending on several functions: 

Vi (*), y 2 (x) y> (z) 1 - 

If a functional satisfies the conditions 

(Ii [ cy (x)] = cd> fy (a:)] ( c = constant) (III. 2) 

( t> ll/i ( x ) + y 2 (*)1 = <t> (yi (x)] + [ y» (a:)] 

it is called linear. We shall denote a linear functional by the sym- 
bol 4> iln , or F Un , etc. 

Functionals depending on several functions may be linear with 
respect to some of them and non-linear with respect to the others. 
In this case, for example, the symbol 

^llni /2 toi (*), Vt (x), . . .) 

will stand for a functional that is linear relative to the function y 2 ( x ). 

It is the task of the calculus of variations to work out methods for 
finding the extremal (i.e. maximum, minimum, or stationary) val- 
ues of functionals. This task is in many aspects similar to the task 
of finding the extrema of conventional functions. 

2. Variation of a Functional. Let us select an arbitrary function 
y ( x ) from a class of functions being considered (we have used the 
tilde sign to distinguish the selected function from the remaining 
functions of the given class). Now let us select another function y ( x ) 
from the same class. The difference of these two functions is called 
the variation of the function y (x). The variation is designated by 
the symbol Sy (x) or simply by. The variation of a function is thus 
determined by the following expression: 

&y = y (x) —~y (x) (in.3) 

The variation of a function is similar to the increment Ax (or dx) 
of the argument of an ordinary function: Ax = x — x. 

The variation by of the function y (x) is evidently a function of x. 
Differentiating this function with respect to x, wo find in accordance 
with (III. 3) that 

(by)' = y' (x) — y' (x) 

The right-hand side of this expression is the variation of the function 
y' (x). Consequently, we arrive at the relation 

(by)' = by' (III.4) 

(the derivative of a variation is the variation of the derivative). 


1 There are also functionals depending on the functions of several variables 
T], x 2 , . . x n . We shall not need such functionals, however, and shall not 
consider them. 
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Let us determine what quantity in the calculus of variations cor- 
responds to the differential of a conventional function. Recall that 
if a function is continuous 1 , its increment A y equal to y (x) — y ( x ) 
can be written as the sum of a term that is linear with respect to Ax 
and an infinitesimal of an order higher than the first one relative 
to Ax: 

Ay — y' (x) Ax -J- e Ax 

where e is a quantity that vanishes together with Ax (in other words, 
lim e = 0). The first term in this expression is called the differen- 
A*-e 

tial of the function. Hence, a differential of a function is defined to 
be that part of the increment of the function that is linear relative 
to Ax. 

For the function y — x, the differential coincides with the incre- 
ment. Consequently, Ax = dx, so that the expression for the differ- 
ential of a function can be written as 

dy — y’ (x) Ax or dy = y' (x) dx (III. 5) 

If a small change in a functional corresponds to a small change 
in a function (to a small dy), the functional is called continuous. 
A quantity similar to the differential of a conventional function can 
be introduced for continuous functionals. 

The increment of a functional Ad> = <t> [y (x) 6y] — d> [y (x)] 

is a quantity depending on two functions: y (x) and 6 y. Therefore, 
Ad> is also a functional. This functional, generally speaking, will 
be non-linear. If Ad) [y (x)] can be written as the sum of the func- 
tional F lln 6y [y (x), 6y] that is linear relative to 6 y and an infi- 
nitesimal of an order higher than the first one with respect to | 5y | max 
(the maximum value of the magnitude of the function 5y), the main 
part of the increment of a functional linear relative to 8y is called 
the variation of the functional. Hence, 

Ad) [y (x)l = 60 [y (x)] + e j 8y | max (111.6) 

where 

60) [y (x)l = .Fiin.oj, [y (x), 6y] (I1I.7) 

is the variation of the functional, and e is a quantity vanishing to- 
gether with | 6 y | max . 

The variation of a functional is the analogue of the differential 
of a function determined by expression (III. 5). 


1 More strictly, the function ought to be not only continuous, but also differ- 
entiable. As a rule, however, the functions considered in physics are also differ- 
entiable if they are continuous. Although, generally speaking, continuous func- 
tions are known in mathematics that are not differentiable. 
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The definition (III. 7) can easily be generalized for functionals- 
depending on several functions: 

60 [i/i (x), y 2 ( x ), . . ., ij s (x)] 

= 6 1 / 2 . .... 6yi ll/i (x), I /2 (x)i • • ■< JJs {x) t 6jfii by*, • • • 

• • 6y s ] (III -8> 

As an illustration, let us find the increment of the functional 

6 

0[j/(x)] = j ly(x)] 2 dx (111.9) 

a 

This increment is 

6 b 

A0 = 0[y (x) + 6t/] — O [y (x)] = j [y (x) + 6y] 2 dx — j [y(x)] 2 dx 

a a 

b b b 

— j [2 y (x) 6 y + (6y) 2 ] dx = j 2y (x) 6 ydx + [ (6 y) 2 dr 

a a a 

b 

The integral J (6 y) 2 dx does not exceed the quantity | 6 y |max (6 — 

a 

— a) = (| 6 y | max (b — a)} | 6y | max , where | 6 y | max is the maxi- 
mum value of the magnitude of the function 6 y within the interval 
a ^ x ^ b. The expression in braces vanishes together with | 6 y i rayx . 
b 

Consequently, j (6 y) 2 dx can be written as e | 6 y | max , where e -*■ 0 

a 

when | 6 y | max -> 0. Hence, 

6 

AO— j 2 y (x) 6 y dx -| - e | 6 y | mtlx 
<1 

[compare with (III. 6)). The first term in this expression is the func- 
tional depending on the functions y (x) and 6y: 

b 

Fun, 6y [y (x ) . %] = j 2 y (x) 6y dx (III. 10) 

a 

It is easy to verify that this functional is linear with respect to by 1 
[see the conditions (III. 2)]. Hence, expression (III. 10) gives the varia- 
tion of the functional (III. 9). 

1 The given functional is also linear with respect to y (x), but this is of no- 

significance— only the linearity with respect to 6 y is important. 
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3. Necessary Condition for the Extremum of a Functional. Let us 
again begin with similar concepts from calculus. The function / (x lt 
x 2 , . . ., x n ) is said to have an extremum at the point x 1 , x 2 , . . . 
. . x n if the increment of this function 

A/ = / (x„ x 2 , x n ) — f (x u x 2 , . . ., x n ) 

has the same sign for all the points (z lt x 2 , .... x n ) belonging to 
the vicinity of the point (x lt x 2 , . . x n ). There is a maximum at 
a given point when A / ^ 0, and a minimum when A / ^ 0. 

It is proved in calculus that a necessary condition for the existence 
of an extremum at a point is the equality to zero of the differential 
of the function at this point: 

n 

h = i 

The functional d> [y (x)] is similarly said to reach an extremum at 
y — y (x) if the increment of the functional 

Ad) = d> [y (x)] — d> [y (x)l 

has the same sign for all the curves y (x) sufficiently close to the curve 
y (x). When Ad) ^ 0, a maximum of the functional is observed, 
and when Ad) ^ 0, a minimum. 

Let us find the condition necessary for the functional d> [y (x)] 
to reach an extremum at y — y (x). For clarity, we shall consider the 
case of a maximum. If d) [y (x)] reaches a maximum at y = y (x), 
this signifies that 

d) [y (x) + 8y] — d> [y (x)l < 0 (III.12) 

for all 6 y = by (x) for which 1 by | max is sufficiently small. By (III. 7) 
and (III. 8) 

Ad» = d) l y (x)] + by] — d> [y (x)] 

= F\\n, 6y ly (x), by) + e \ by | max (III. 13) 

Let us separate from all the possible variations by those that can 
be represented in the form by = a by 0 , where by 0 is a fixed sufficient- 
ly small variation, and a is a varying algebraic quantity. Introduc- 
ing this variation into (111.13) and taking into account that owing 
to linearity F ly (x), aby n ] = a F [y (x), 6y„], we can write 

Ad) = a F [y (x), by 0 ) + ea i by 0 \ max 

In the last expression, F ly (x), 6t/ 0 ] is simply a number. If this 
number is non-zero, at sufficiently small a’s the sign of Ad> will 
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be determined by that of the expression a F [y ( x ), 6i/ 0 ] (the term 
ea | 8y 0 | max diminishes much more rapidly than the first term 1 ), 
and this expression will change its sign together with a (which may 
be either positive or negative). Hence, for the condition (III. 12) 
to be satisfied, the quantity F [y (x), 6 y 0 ] must equal zero. The con- 
dition (III. 12) must be satisfied for all sufficiently small variations 
6 y without any exception. We have found, however, that if Sd> *£= 0 
for even part of the variations having the form ccS y 0 , the above con- 
dition is not observed. We can therefore state the following: for the 
functional <J> [y (x)] to achieve a maximum at y — y (x), its variation 
(if it exists) must vanish at y = y (x): 

6<D - F lln , 6y ly(x), 5y] = 0 (HI. 14) 

[The identity sign underlines the circumstance that the condition 
(III. 14) must be satisfied for all the 8y’s.] 

It can be seen that by repeating our reasoning for the minimum 
of a functional, we shall arrive at the same conclusion. Consequently, 
formula (1 1 1. 14) expresses the condition necessary not only for a max- 
imum, but also for a minimum, i.e. for an extremum in general. 
This formula is an analogue of formula (III. 11). 

4. A Simple Problem in the Calculus of Variations. Let us find the 
extremum of a functional having the form 

$ [ y (s)] = j / [X, y (x), y' (x)] dx (III. 15) 

The boundary points of the allowable curves are assumed to be fixed — 
for all the allowable curves 

V ( x i) = Vi and y (x 2 ) = y 2 (III. 16) 

The increment of the functional is 

*« 

Ad»= [ f[x, y + 8y, y' + 6i/'] dx— j f[x, y, y')dx 

We expand the integrand of the first integral in powers of the small 
quantities 8y and 5 y' . The result is 

A<& = ] {fix, y, y’]+~8y + -^r8y' 

X X 

x t 

+ e(8y, 6y')|dx— j /[x, y, y']dx 

*< 

1 The condition e -► 0 for | 6y ! ma x -*■ 0 in the given case acquires the form 
lim e = 0. 
a .-*0 


20-018 
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where e (by, by') combines the terms of an order higher than the 
first one relative to the quantities 6 y and by'. This expression is 
simplified as follows: 

X, 

A<P = j {^by+-^by')dx+ j 8(6 y, 6y')dx (III.17) 

*i *« 

Let us integrate the second term in the first integral by parts. For 
this purpose, we write it as 

J “ ] dx = 1 -w {6yY dx 


x x 


Xi 


{remember that by' = (by)', see formula (III. 4)]. Designating djldy' 
by u, and (by)’ dx by dv and taking advantage of the formula j u dv — 

= uv — j v du, we obtain 

J== ]-&-w dx =~tr 8 y | -]-L(-jr)*y dx 

*i *t *t 

Since the boundary points of the allowed curves are fixed, the varia- 
tion by at these points must vanish: by (x x ) = 0, and by (x 2 ) = 0. 
Therefore, the first term on the right-hand side vanishes, and 

Let us introduce the found value of J into formula (III. 17), factoring 
out by: 

X, X, 


fJl 

d 

df 

A 

l dy 

dx 

dy' 

J 


X, X, 

The main part of A<J> is formed by the first integral that is a func- 
tional linear with respect to by. By definition, this integral is the 
Variation of the functional <J>. Hence, the variation of the functional 
(III. 15) is 


/JL- 

d 

df 

A 

l dy 

dx 

dy' 

/ 


(III. 18) 


*i 


and the condition (III. 14) for an extremum will be written as 


flL_ 

d df 

l dy 

dx dy' 


7 - j by dx= 0 
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The identity obtained must be observed for any sufficiently small 
functions 8y = Sy ( x ). This is possible only provided that the inte- 
grand in braces vanishes: 


df_ d dj _» 

dy dx dy' 


(III. 19) 


This equation is known as Euler’s equation. It is the condition for 
an extremum of functionals of the form given by (III. 15). The curves 
y = y (x, C lt C 2 ), which are solutions of this equation, are called 
extremals (Cj and C 2 are integration constants). 

We must note that the addition to the integrand in (III. 15) of 
the total derivative with respect to x of any function aj? (y, x) does 
not change the conditions of the extremum (III. 19). Indeed, this 
term after integration yields the quantity 

*. 

j -57 ^ = ^(1/2. *2) — 'Mkl, X i) 

Xi 


whose variation is zero [according to the condition (1 1 1. 16), the 
curves do not vary at their ends]. Hence, the addition of ch\ildx changes 
only the value of the extremum of a functional, but does not affect 
the form of the function y (x) at which this extremum is reached. 

Let us use formula (III. 19) to find th e extremum of the functional 
(III.l). In this case, / ( x , y, y') — ]/ 1 + [y‘ (x)l 2 . Hence, 

JL—a y’ (*) 

dy ’ dy' /i+lFl 5 

and Eq. (III. 19) becomes 

A V' -( 1 tl = 0 

dx /i+y'2 + y' 2 (l+y' 2 ) 3/2 ' 

A function for which y" = 0 and y' — a will be the solution of 
this differential equation. Consequently, the function itself is a 
linear one, y — ax-f-b, whose coefficients must be chosen so as to 
satisfy the conditions (III. 16). Therefore, functional (III.l) reaches 
an extremum (in this case, evidently, a minimum) if we presume 
that y(x) is a straight line joining the points 1 and 2 (see Fig. III.l). 

5. Extremum of Functionals Depending on Several Functions. 
Consider a functional of the form 


*1 

y 2 > •••> i/*l= j /[*. Vi, y z , .... y„ 

*1 


y \ . y'v • ••> y'*\ dx 


20 * 


(III. 20 ) 
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where y h = y h (x) (k — i, 2, . . ., s) are curves fixed at boundary 
points, i.e. satisfying the boundary conditions 

Uh fo.) = Vh i. yh (z 2 ) = yh z (* = l> 2, . . s) 

(III. 21) 

In accordance with what has been said in paragraph 4, the varia- 
tion of the functional (1 1 1 .20) is determined by the expression 

M, “J < IIL22 > 

X, fc= I 

where 6 y h are sufficiently small functions of x vanishing at the 
boundary points 

(*i) = 0, 6 y h (x 2 ) = 0 (1 1 1. 23) 

It is necessary to find such a set of functions y h (x) satisfying the 
conditions (III. 21) at which the functional (III. 20) reaches an extre- 
mum. A necessary condition for an extremum is the vanishing of 
the variation of the functional (III. 22). 

Integrating by parts each of the s addends of the form 





in formula (1 1 1. 22), we can bring it to the form 

»1 

Performing such a replacement in (III. 22) and equating to zero the 
expression obtained for 8<t>, we arrive at the necessary condition for 
an extremum: 

? s(4— I- ^r) «»**'- 0 < ni - 24 > 

Xi fc=l 

This identity must be obeyed for any sufficiently small functione 
6y h == 8y h (x) selected independently of one another. This is possibls 
only provided that all s expressions in parentheses inside the sura 
are zero. We have thus arrived at a system of Euler’s equations: 

isr-i-gir - 0 ( *“ 1,2 s) (III - 25) 

The set of functions y k — y h (x) satisfying these equations and the 
boundary conditions (III. 21) when substituted into the functional 
(III. 20) will give its extremum. 
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IV. Conics 

Conics (conic sections) are defined to be the lines of intersection 
of a circular cone with a plane. Depending on the orientation of the 
plane relative to the axis of the cone, these lines are either an ellipse 
(circle), a hyperbola, or a parabola. 




An ellipse (Fig. IV.l) is defined to be the locus of points, the sum 
of whose distances from two fixed points F x and F 2 called the foci is 
a constant quantity: 

' r, -l r 2 = 2 a (IV.l) 


The canonical equation of an ellipse is 

- 5 -+- S -- 1 ( IV - 2 ) 

where a and b are the major and minor semiaxes of the ellipse. The 
quantity 

(IV.3) 


C 

a 


where c is half the distance between the foci, is called the eccentricity 
of the ellipse. When e = 0, an ellipse degenerates into a circle. 
The quantities a , b and c are related by the expression 

b 2 = a 2 — c 2 (IV. 4) 


A hyperbola (Fig. IV. 2) is defined to be the locus of points, the 
magnitude of the difference of whose distances from two fixed points 
Fi and F t called the foci is a constant quantity: 


I 1 * 2 I —2a 


(IV.5) 


A hyperbola has two symmetric branches. The canonical equation of 
a hyperbola is as follows: 


a* 


a 


l 


(VI. 6) 
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where a and b are the real and imaginary semiaxes of the hyperbola. 
The quantities a and b are related to c (half the distance between the 
foci) by the expression 

fe 2 = c 2 — a 2 (IV.7) 

The eccentricity of a hyperbola is determined by the same for- 
mula (IV.3) as that of an ellipse. It can be seen that for an ellipse 
e < 1, and for a hyperbola e > 1. 




A parabola (Fig. IV.3) is defined to be the locus of points whose 
distance r from a fixed point F (the focus) equals the distance d 
from the fixed straight line D called the directrix of the parabola: 
r = d. The canonical equation of a parabola is 

= 2 px (IV.8) 

where £ is the parameter of the parabola, equal to the distance from 
the focus to the directrix (the x-axis is directed along the axis of 
symmetry of the parabola, and the origin of coordinates coincides 
with the apex of the parabola). As we shall see below, the eccen- 
tricity of a parabola should be taken equal to unity. 

Any conic can be determined as the locus of points for which the 
ratio between the distance r (Fig. IV.4) to the point F (called the 
focus) and the distance d to the straight line D (called the directrix) 
is a constant quantity e (called the eccentricity of the curve): 



The relevant conic is obtained depending on the value of the ec- 
centricity e (Table IV.l). 
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Table IV. 1 


Value of e 

Kind of curve 

< 1 

Ellipse 

1 

Parabola 

>1 

Hyperbola 


An ellipse and a hyperbola each has two foci and two directrices. 
The condition (IV. 9) is observed for each of the foci and the directrix 
corresponding to it. Figure IV. 5 shows the foci and directrices of an 
ellipse (a) and a hyperbola ( b ) (compare with Fig. IV. 3 for a parabola). 



Fig. IV. 5. 


The distance p' from a focus to a directrix is called the parameter 
of the relevant curve. The quantity 

p = p'e (IV. 10) 

is known as the focal parameter. It is a simple matter to see that it 
equals half the chord passing through a focus and parallel to the 
directrix (Fig. IV.4). For a parabola, p — p' . 

The distance between the directrices of an ellipse (hyperbola) 
equals 2 {ale), where a is the major (real) semiaxis of the curve. 

Let us write the equation of a conic in polar coordinates, placing 
the origin of coordinates at one of the foci of the curve (Fig. IV.4). 
In accordance with (IV.9) 

r r 


d p' + r cos q> 


e 









312 


APPENDICES 


whence 


r 


P 

1 — e cos <p 


(IV.ll) 


[ p = p'e-, see (IV. 10)). 

Equation (IV.ll) describes an ellipse (with the origin of coordi- 
nates at the point F x \ Fig. IV. 5a), the right-hand branch of a hyper- 
bola (with the origin of coordinates at the point F 2 , Fig. IV. 56), 
and a parabola. We must note that the focus F t is the inner one for 
the right-hand branch of the hyperbola. 




If we place the origin of coordinates at the right-hand focus Of an 
ellipse (point F 2 in Fig. IV. 5a), a glance at Fig. IV. 6 shows that the 
equation of the ellipse is 

r 

— e 

p — r cos cp 

or 


r = 


P 

1 + e cos (p 


(IV. 12) 


(p = p'e). The same equation describes the left-hand branch of a 
hyperbola (see Fig. IV. 56) provided that the origin of coordinates 
is placed at the point F x (at the inner focus with respect to this branch), 
and also a parabola that is the mirror (relative to D) image of the 
parabola depicted in Fig. IV. 3. The focus F of such a parabola is to 
the left of the directrix D. 

Let us find the equation of one of the branches of a hyperbola 
(say, the left one) provided that the origin of coordinates is at the 
external focus relative to this branch (at the point F 2 for the left 
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branch; Fig. IV. 5b). In this case (Fig. IV. 7), 

d' = r cos (jx — <p) — 2 ^- — p' — — r cos cp — 2 — p’ 

According to the definition (IV. 5) of a hyperbola, r — r' = 2 a t 
whence 

r' — r — 2a 


Introducing the values d' and r' we have found into relation (IV.9), 

we arrive at the formula 

r' r — 2 a 

— = e 

I* d m 

— r cos cp — 2 p 


After simple transformations, we obtain the required equation 


r 


~p 

1-J-ecoscp 


(IV.13) 


(P — P' e )- If must be borne in mind that for the left branch cp > 
> n/2 (see Fig. IV. 7), i.e. cos cp <C 0. In addition, for all points- 
| e cos cp | > 1 so that the values of r obtained by formula (IV.13) 
will be positive. 

We invite our reader to convince himself that the similar equation 
for the right branch (the origin of coordinates is at the point F x ) is- 


r 


—p 

1 — e cos 9 


(IV. 14) 


in this case, cp < n / 2 so that cos cp > 0. In addition, e cos cp > 1 
so that the values of r are positive. 


V. Linear Differential Equations with Constant 

Coefficients 

A linear differential equation of the n-th order with constant coef- 
ficients is defined to be an equation of the kind 

y< n > + a n _i + • • • + a x y' + a 0 y = / (x) (V.l) 

s.e. an equation linear in the unknown function y (x) and its deriva- 
tives (the a’ s are constant quantities that may also be zero). 

If the right-hand side of an equation identically equals zero [/ {x) = 
= 0), the linear equation is called homogeneous, otherwise it is 
non-homogeneous. A homogeneous equation has the form 

y( n) + fln-ii / (n_1) + . • • + a x y' -f a 0 y = 0 (V.2) 

The general solution of a differential equation is defined to be the- 
multitude of solutions including all the particular solutions with 
no exceptions. The general solution of an n-th order differential 
equation contains n arbitrary constants (integration constants),. 
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i.e. has the form 

y = y (x, Ci, C 2 , . . ., C n ) (V.3) 

By giving the constants C t , C 2 , . . ., C n definite values, we obtain 
a particular solution. The latter contains no arbitrary constants. 

It is proved in the theory of linear differential equations that if 
y x , y 2 , . . ., y n are linearly independent 1 solutions of the homoge- 
neous equation (V.2), the general solution of this equation can be 
written as 

y (x, C it C 2 ,..., C n ) = y Con ( x ) (V .4) 

i=i 

where C x , C 2 , . . C n are arbitrary constants. 

Let y (x) be one of the particular solutions of the non-homogeneous 
aquation (V.l) and y ( x ) be the general solution of the same equation. 
If we introduce the notation u (x) — y (x) — y (x), the general solu- 
tion can be written as follows: 

y (x, Ci, C 2 , . . ., C n ) = u (x, Ci, C 2 , • • C n ) -f - 

+ y(x) (V.5) 

Let us substitute this function into Eq. (V.l) and group separately 
the terms of the kind and the terms of the kind 
.u< n > + a n -i^ ( " _1) + • • • + cliU + a 0 u 

+ [y( n) + a n ^y( n -^ + . . . + a x y' + a 0 y] = f (x) 

The function y (x) is a particular solution of the equation. Conse- 
■quently, the expression in brackets equals the right-hand side of 
the equation. It follows that the function u (x, C u C 2 , ■ . C n ) 
satisfies the condition 

u.( n ) -}- a<n-i)W (n_1) -L . . . a x u' -j- a 0 u = 0 

Hencef u is the general solution of the homogeneous equation (V.2) 
corresponding to the non-homogeneous equation (V.l), i.e. having 
the same coefficients a h as Eq. (V.l). 

The result we have obtained can be formulated as follows: the 
general solution of a linear non-homogeneous equation equals the sum 
of the general solution of the corresponding homogeneous equation and 
a particular solution of the non-homogeneous equation: 

y (gen., non-hom.) = y (gen., hom.) + y (part., non-hom.) (V.6) 

1 The set of functions fi, f 2 , ■ • •, f n * s called linearly independent if an expres- 
sion of the kind 

/i + a 2 • •+<*« /n= 0 

is observed only provided that all the cu’s vanish. 
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Linear homogeneous differential equations with constant coefficients 
are solved with the aid of the substitution 

y ( x ) = e Xx (V.7) 

where X is a constant. Differentiating this function m times (m — 
— 1, 2, . . 7i), we obtain 

yi. m ) = (V. 8) 

Introducing the values of the function (V.7) and its derivatives 
(V.8) into Eq. (V.2) and cancelling the non-zero factor e*-*, we arrive 
at what is called a characteristic equation: 

X n + + • • • + + a 0 — 0 (V.9) 

The roots of this equation are the values of X at which the function 
(V.7) satisfies Eq. (V.2). 

If all n roots of the characteristic equation are different (multiple, 
i.e. coinciding roots are absent), n particular solutions of the kind 
eh* will be linearly independent. Consequently, in the absence of 
multiple roots, the general solution of Eq. (V.2) is as follows: 

pic/' 1 (V.10) 

i=*l 

(Ci, C 2 , . . ., C n are arbitrary constants). 

It can be shown that when the characteristic equation (V.9) has 
multiple roots, p ^ linear independent particular solutions corre- 
sponding to the root X ^ of multiplicity p u must be taken in the form 

eV, zeV, x 2 eV, .... 

so that the contribution to the general solution corresponding to 
them equals the sum 

h=l 

Consequently, if the root will be of the multiplicity p u the root 
X 2 of the multiplicity p 2 , . . ., X m of the multiplicity p m (here 
Pi + Pz + . . • + Pm — n ), the general solution can be written as 

m P, i 

2 2 (V.ll) 

u=l 1 

Let the coefficients a h in Eq. (V.l) be real, and the function / (x) 
be complex. Writing it in the form 

/ (x) = /i (x) + if 2 (x) 
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we get the equation 

y m + a n _ 1 y( n - 1 '> + . . . + a x y‘ + a 0 y = 

= U (x) + ij 2 (x) (V.12) 

We shall seek the solution of this equation in the form 
y (ar) = y x (x) + iy 2 (x) 

Substitution into (V.12) yields 

(y[ n) + an-iy (n ~ 1) +••■•+ a x y’ t + a 0 y x ) 

+ i ( y 2 n) + a n _,y\ n ~ 1 ' + • ■ • + a x y' t + a 0 y 2 ) = 

- /i (*) + */. (*) (V.13) 

In complex numbers equal to one another, the real and imaginary 
parts are equal to one another independently. Hence, Eq. (V.13) 
breaks up into two independent equations of the form of (V.l). The 
right-hand side of one of them contains the function f x ( x ), and the 
function y x (x) is its solution. The right-hand side of the other equa- 
tion contains the function / 2 (x), and the function y 2 (x) is its so- 
lution. This property of Eq. (V.13) is due to its linear nature. It 
allows us to use the following procedure that sometimes considerably 
facilitates calculations. Assume that the right-hand side of the 
equation (V.l) we are solving is real. We add an arbitrary imaginary 
function to it. After now finding the complex solution of the equa- 
tion obtained, we take its real part. It will be a solution of the initial 
differential equation. 

The following statement is obvious: if a linear homogeneous equa- 
tion (V.2) with real coefficients has the complex solution y (x) — 
= y x (x) + iy 2 (x), each of the functions y x (x) and y 2 (x) separately 
is a solution of this equation. 

VI. Vectors 

1. Basic Definitions. Vectors are quantities defined by a numerical 
value (magnitude) and a direction and, in addition, are added geo- 
metrically (i.e. according to the triangle or parallelogram method). 
On a later page, we shall give a more general definition allowing us 
to extend the concept of a vector to an n-dimensional space. 

The scalar product 1 of two vectors a and b is defined as the scalar 
quantity 

ah = ab cos (a, b) (VI. 1) 

A scalar product is commutative (ab = ba) and distributive 
{a (b x -f b 2 + . . .) = abj + ab 2 -f- . . .}, but is not associative 
{a (be) ^ (ab) c}. 

1 Another way of writing a scalar product in addition to the one we use is 
a-b, which explains the name dot product sometimes used for it. 
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A vector product is defined as the vector 1 

[ab] = ab sin (a, b)-n (VI. 2) 

where n is the unit vector of a normal to the plane containing the 
vectors a and b, the sequence a, b, n forming a right-handed system. 

A vector product is not commutative ([ab] =£=■ [ba]), is distribu- 
tive ([a, (b, + b 2 + . . .)] — - [ab, ] -f [ab.,] + . . .), and is not 
associative ([[ab], cl [a, [be]]). 

Let us consider a scalar triple 

product of three vectors: a [be]. ay ^- — -j j 

Applying formulas (VI. 1) and f / / / 

(VI. 2), we obtain ni j / / 

C / J_ f 

a [bcl = a {be sin (b, c)} cos (a, n) (a,n){y ^ 

A glance at Fig. VI. 1 shows that (h]c) 

the expression we have obtained 

equals the volume of the paral- i ’ i S- VI. 1. 

lelepiped constructed on the vec- 
tors being multiplied 2 . Indeed, be sin (b, c) gives the area of 

the base of the parallelepiped, and a cos (a, n) gives its altitude. 
We can also take a face whose sides form the vectors c and a or 
the vectors a and b as the base of the parallelepiped. In this case, 
the volume will be determined by the scalar triple products b [ca] 
and c [ab]. Since the volume in all three cases is the same, we can 
write 

a [be] = b [ca] = c [ab] (VI. 3) 

A scalar triple product thus allows a cyclic transposition of the 
factors, i.e. the substitution for each factor of the one following or 
preceding it. The vector c is presumed to be followed by a, and a 
is assumed to be preceded by c, which can be illustrated by the fol- 
lowing diagram: 

a — b 

\/ (VI. 4) 

c 

We must note that in all three expressions of formula (VI. 3) the 
vectors have the same sequence as in the diagram (VI. 4). If we take 
the vectors in a sequence that is the opposite of what is shown in 
the diagram (VI. 4), the scalar triple product will change its sign. 

A vector triple product is defined to be the vector [a [be]]. We 
can prove (this will be done somewhat later) that 

[a, [be]] = b (ac) — c (ab) (VI. 5) 

1 Another way of writing a vector product is a X b, which explains the 
name cross product sometimes used for it. 

2 The angle (a, n) is assumed to be acute. If this angle is obtuse, the scalar 
triple product equals the volume of the parallelepiped taken with a minus sign. 
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In accordance with the definitions (VI. 2) and (VI. 1), the square of 
the vector product of the vectors a and b can be transformed as 
follows: 

[ab] 2 = a 2 b 2 sin 2 (a, b) = a 2 b 2 — a?b 2 cos 2 (a, b) = a 2 b 2 — (ab) 2 
We have arrived at the formula 

[ab] 2 = a 2 b 2 - (ab) 2 (VI. 6) 

2. Formulas of Vector Algebra Expressed in Terms of Projections 
of Vectors onto the Coordinate Axes. All the above definitions and 
formulas do not depend on the choice of the coordinate system used 
for our consideration. If we set up a coordinate system (we shall 
consider only rectangular, i.e. Cartesian systems), each vector can 
be set by three numbers— its projections onto the coordinate axes 1 . 
Consequently, the vector a is equivalent to the three numbers a x , 
a y , a z , the vector b— to the numbers b xi b y , b z , etc. 

Knowing the projections of a vector onto the coordinate axes, we 
can find the vector itself. Denoting the unit vectors of the coordinate 
axes by the symbols e x , e y , e z , 2 we can represent the vector as 

a = e x a x + e y a y + e z a z (VI. 7) 

To obtain the possibility of writing the formulas in a compact 
form using the sum sign T , we shall use the symbol x x instead of the 
coordinate x, x 2 instead of y, and x 3 instead of z in the following. 
Similarly, we shall introduce the symbols e x , e 2 , e 3 for the unit 
vectors of the axes. The correspondence between the previous and 
the new symbols is shown below: 

y^xA (VI. 8) 

Z-+X 3 J 
i = e 3C ->e 1 'J 

j = e » -*• e 2 > (VI. 9) 

k = e 2 — e 3 J 

In the new notation, formula (VI. 7) can be written as 

a = Se A (VI.10) 

h 


1 We are treating free vectors, for which the point of application of the 
vector and the straight line along which it is directed are not fixed. 

2 These unit vectors are also designated by the symbols i, j, k. The notation 
we have adopted, however, as will be seen from the following treatment, has an 
undoubted advantage. 
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(unless otherwise indicated, we shall always assume that the subscript 
over which summation is performed— the dummy index — runs 
through the values 1, 2, 3). 

The triplet of vectors e t , e 2 , e 3 forms the basis of a coordinate sys- 
tem. The setting of these vectors completely determines the system. 
Since e lt e 2 and e 3 aro mutually perpendi- 
cular, and their magnitudes equal unity, 
it can be seen from formula (VI. 1) that 

e i e ft = (VI. 11) 


where 8 ih is the Kronecker symbol deter- 
mined as follows: 

{ 1 when i = k 

0 when i = k 


(VI. 12) 


It must be noted that 8 ih = b hi (it will 
be shown in Appendix X that the set of 
the quantities 8,*, forms a symmetric 



second-rank tensor). 


Examination of formula (VI. 2) reveals that (Fig. VI. 2) 

[ e i e 2 1 == e 3 
[e 2 e 3 ] == 
le 3 e i ] = e 2 


(VI. 13) 


Each of the relations (VI. 13) can be obtained from the preceding 
(or following) one by a cyclic transposition of the subscripts ac- 
cording to the diagram 

1 2 

\/ (VI. 14) 

3 

Let us introduce the symbol 1 

E ihl (VI. 15) 

standing for a set of 27 numbers that are determined by the fol- 
lowing rules: 

(1) if the values of at least two subscripts coincide, we have 

£ihi ^ (for instance, s 3 u ^ 2 hz ■ * £133 ^222 0), 

(2) if all the subscripts are different and form a cyclic transposi- 
tion of the sequence 1, 2, 3, we have e ihl = 1 (e 123 = e 231 = e 312 = 

= i); 

(3) if all the subscripts are different and form a cyclic transposi- 

tion of the sequence 3, 2, 1, we have e,- hi = — 1 (e 32 i = t 213 = 
— 6132 = !)• 

1 It is sometimes called the Kronecker skew-symmetric symbol. It will be 
shown in Appendix X that the set of quantities forms an absolutely anti- 
symmetric third-rank tensor. 
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Hence, of the 27 values of 21 are zero, 3 are +1, and 3 are 
— 1 . 

We must note that any cyclic transposition of the numbers 1, 2, 3 
can be obtained from 1, 2, 3 by an even number of transpositions 
of two subscripts, and any cyclic transposition of the numbers 3, 2, 1 
can be obtained from 1, 2, 3 by an odd number of transpositions of 
two subscripts. Indeed, by changing, for instance, the places of 1 
and 2 in the sequence 1, 2, 3 (which yields 2, 1, 3), i.e. a cyclic 
transposition of the numbers 3, 2, 1), and then changing the places 
■of 1 and 3, we obtain the transposition 2, 3, 1. 

Consequently, the values of the symbol e ikl can be determined as 
follows: (1) they are zero if the values of at least two subscripts coin- 
cide, (2) they are +1 or — 1 depending on whether the sequence i, k, 
l can be obtained from the sequence 1, 2, 3 by an even or an odd 
number of transpositions. 

We can also use the following rule to determine the sign of 
When a larger number is ahead of a smaller one in a transposition, 
we shall call this a disorder. For example, in the transposition 2, 1, 3 
there is one disorder— 2 is ahead of 1, and in the transposition 3, 2, 
1 there are three disorders— 3 is ahead of 1, 3 is ahead of 2, and 2 is 
ahead of 1. Let us assign the value of +1 to e thl if the number of 
disorders in the transposition i, k, l is even, and — 1 if it is odd. It 
is easy to see that all three rules for determining the sign of e ih i 
which we have considered give the same result. 

Let us prove the following very useful relation between the sym- 
bols e and 6: 

2e ift/ e mn i = 6 lm 6 hn — 8 in 6 ftm (VI.16) 

i 

We shall expand the sum on the left-hand side: 

^ihl^mnl “L £(fc2®mn2 ~f" ^ih3^mn3 (VI. 17) 

and determine the values of the subscripts i, k, m, and n at which 
this sum is non-zero. It is evident that for at least one term to differ 
from zero-, the conditions 

i k and m ^ n (VI. 18) 

must be satisfied simultaneously. In addition, it is essential that 

i — m, k — n or i = n, k — m (VI. 19) 

Indeed, if the observance of the condition (VI. 18) will not be attend- 
ed by observance of the condition (VI. 19), all three numbers 1, 2 
and 3 will be present among the values of the first two subscripts 
of both factors in each of the terms in (VI. 17). Therefore, the dummy 
index l in each of the terms will coincide with the value of one of 
the subscripts i, k, m and n, so that all the terms will vanish. 
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Let us combine the conditions (VI. 18) and (VI. 19) into one ex- 
pressed by the formulas 

i — m k — n (VI. 20) 

i = n =/= k = m (VI. 21) 

In the case corresponding to relations (VI. 20), the sum (VI. 17) 
becomes 

®mni®nmi "f" ®tnn2®mn2 ~f~ ^mns^mns 

It is obvious that only one term (for which m, n, and l are different) 
will be non-zero, and it equals +1. 

In the case corresponding to relations (VI. 21), the sum (VI. 17) 
becomes 

^nml®mnl "l” ®nm2^mn2 “T ®nm 3 ^mn 3 

In this sum too, only one term is non-zero, and it equals the product 
of +1 and — 1, i.e. — 1 (upon the transposition of two subscripts 
e nm i changes its sign). 

Now let us turn to the right-hand side of formula (VI. 16), i.e. to 
the expression 

&imdhn — (VI. 22) 

If i = k (or m — n ), this expression becomes 8 km 8 hn — 5 )in 5 hm (or 
8 in 8 ftn — 8/n5fc;,)- Both these expressions are zero. It thus follows 
that for expression (VI. 22) to be other than zero, the following con- 
ditions must be observed simultaneously: 

i k and m =£ n (VI. 23) 

[compare with (VI .18)]. In addition, one of the following two condi- 
tions must be observed: 


i — m, k — n (VI. 24) 

i — n, k — m (VI. 25) 

When the condition (VI. 24) is satisfied, the first term of expres- 
sion (VI. 22) is -pi. Since i = m and in =P n Isee (VI. 23)], we have 
i ■=/=■ n, and the second term of expression (VI. 22) vanishes. Conse- 
quently, when the condition (VI. 24) is observed, expression (VI. 22) 
is +1. The combination of the conditions (VI. 23) and (VI. 24) is 
equivalent to the condition (VI. 20) in which, as we have established, 
the left-hand side of formula (VI. 16) also becomes equal to +1. 

When the condition (VI. 25) is observed, which in combination 
with (VI. 23) is equivalent to the condition (VI. 21), expression 
(VI. 22) becomes equal to — 1. 

We have thus proved relation (VI. 16). 


21-018 
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Using the symbol e ifti , we can write the combination of relations 
(VI. 13) as a single expression: 

[ e I e ft] = 2 & lhl e l (VI. 26) 

i 

Indeed, when i — 1 and k — 2, only the addend e 1Z3 e 3 equal to e 3 
will be non-zero, when i — 2 and k = 3, only the addend = e x 
will be non-zero, and, finally, when i — 3 and k = 1, the addend 
e si 2 e 2 = e 2 will be non-zero. 

The expression (VI. 26) gives even more than a combination of 
three relations (VI. 13). It contains nine relations. It follows from 
it that the vector product of any unit vector by itself is zero— when 
i = k, all the addends, on the right-hand side of formula (VI. 26) 
vanish. In addition, (VI. 26) contains expressions obtained from 
(VI. 13) by transposition of the multipliers. For instance, when 
i = 2 and k = 1, the addend e 213 e 3 = — e 3 on the right in (VI. 26) 
is non-zero, etc. 

Let us form the scalar product of the vectors a — 2 &i a i and b = 

= 2eA: 

ab = (2 e i ffl ()(2e A) 

i h 

Using the property of distributivity, we can write 

ab = 2 e i e h a ( 6 h = 2 

i i.h i.h 

[see formula (VI. 11)]. In accordance with the definition of S ift , in 
the last sum only addends having identical values of the subscripts 
i and k are non-zero. Therefore 

ab=2 a A (VI. 27) 

i 

or, going over to conventional symbols, we have 

ab = a x b x + a y b y + a z b z (VI. 28) 

For the given vectors a and b, their projections onto the coordi- 
nate axes depend on the choice of the coordinate system, but the 
product ab itself does not depend on this choice. We thus conclude 
that the expression o, x b x + a u b y + a z b z is an invariant, i.e. a quan- 
tity identical in all coordinate systems. 

Assume that we have been given a set of three numbers u, v, w 
about which we know that in combination with the projections of a 
vector a they yield a scalar, i.e. an invariant: 

ua x + va y + wa t = inv 
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On the basis of what has been said above, we can assert that u, v, 
and w are the components 1 * of a vector. 

Let us form the vector product of the vectors a = 2 e i a i and b = 

= 2 e ft : 

[abj == [(2 (SeA)l 

i a 

Owing to distributivity, we can write 

[ab] = 2 [ejejJfli&ft 

i, h 

Let us replace le,e ft ] in accordance with formula (VI. 26): 

[ab] = 2 2 = 2 & m a ii>h e i 

i, ft l i, ft, i 

The vector product can thus be written as 

[ab] = 2 (VI. 29) 

i, ft, l 

Of the 27 addends of this sum, only six are non-zero. Writing them 
out, we obtain 

[ab] "h ^2^3®! “F a 3^1 a 2 a 3^2®l a 2^1®3 ‘ a l^3 a 2 

Finally, combining terms with identical unit vectors, we arrive at 
the expression 

[ab] = e, (a 2 b 3 — a 3 b 2 ) + e 2 (a^ — a x b 3 ) + e 3 (a x b 2 — a 2 & i) (VI. 30) 
that can be written in the form of a determinant (see Appendix VIII) 

Gj Co Gg 

[ab] = a t a. 2 a 3 (VI. 31) 

b i b 2 b 3 

or in conventional notation 

e x e y e z 

[ab] = a x a y a z (VI. 32) 

b x by b z 

We must note that according to (VI. 29), the l-ih component of a 
vector product is determined by the formula 

[ab]i = 2 e ifti a i^ft == 2 Zuh a ibk 

i. ft i, A 

(we have performed a cyclic transposition of the subscripts on e, 
which, as is known, does not change the numerical value of this 

1 For brevity’s sake, we shall use this term for the projections of a vector 

onto the coordinate axes. 


21* 
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symbol). To deal with the more customary sequence of letter sub- 
scripts, we shall write an expression for the i-th component of a 
vector product: 

[ab], = 2 e jk ia h &j (VI.33) 

ft. i 

Xet us prove formula (VI. 5) using relation (VI. 16). For this pur- 
'pose, we shall write the vector product in accordance with formu- 
la (VI. 29): 

d = [a, [be]] = 2 e ftn®ft [bcj pr , t e t 

ft, i, i 

Now, we shall substitute for [bc] pr j its expression obtained from 
formula (VI.33): 

^ = ZJ ®ft/j®ft®i 2 mn^m^n 
ft, l. i m, n 

Let us perform a cyclic transposition of the subscripts on the sym- 
’bols e so that their common subscript l will be in the last place. In 
: addition, let us group the factors so that summation over l is per- 
formed first of all: 

6 = 2 ^thl^mnl ~ 2 ^in^ftm) 

i, ft. m, re l i, ft. m, n 

Iwe have employed relation (VI. 16)]. Further transformations yield 

•d = 2 2 2 ^hn^n 2 2 ^in^n 2 ^hrrfim 

.1, ft m n i, ft n m 

= 2 2 

i, ft i, ft 

= 2 efit 2 flftCft ~ 2 W, 2 flft&ft = b (ac) - c (ab) 

i h i h 

Q.E.D. 

3. True Vectors and Pseudovectors. Two kinds of vectors are dis- 
tinguished: polar (or true) and axial vectors, also known as pseudo- 
vectors 1 .'’ Upon inversion of the coordinate axes, i.e. when the 
directions of the coordinate axes are reversed (Fig. VI. 3), the com- 
r ponents of a true vector change their sign. This signifies that such a 
vector upon inversion remains unchanged. The components of a pseu- 
dovector upon inversion do not change their sign. This signifies that 
.a pseudovector upon inversion reverses its diiection (i.e. changes 
its sign). 

Inspection of Fig. VI. 3 shows that upon inversion of the coordi- 
nate axes, a right-hand system of coordinates transforms into a left- 
's hand one. The distinction between a true vector and a pseudovector 
•can therefore be defined as follows: a true (polar) vector does not 


1 “Pseudo” is a prefix meaning false or sham. 
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change upon a transition from a right-hand system of coordinates to 
a left-hand one, whereas a pseudovector reverses its direction in 
such a transition. It will be shown in Appendix X that a pseudovector 
is an antisymmetric second-rank tensor. 

If both vectors a and b are true, the components of the vector pro- 
duct (VI. 30) upon inversion do not change their sign (a ; and b * 
separately change their sign, but their 
product remains unchanged). Consequent- 
ly, the vector product of true vectors is 
a pseudovector. 

Scalars must also be divided into two 
kinds: true scalars and pseudoscalars. 

True scalars do not change in a transition _ 
from a right-hand coordinate system to a (yj 
left-hand one (or upon inversion of the 
coordinate axes). They include mass, 
electric charge, and temperature. Pseudo- 
scalars change their sign in a transi- ^ 

tion from a right-hand coordinate system T( z ) 

to a left-hand one. They include the scalar 
expressions obtained as a result of mathe- Fig. VI. 3. 

matical operations on vectors. For ex- 
ample, the scalar product [see (VI. 27)1 of a true vector and a pseudo- 
vector changes its sign upon inversion and, consequently, is not 
a true scalar, but a pseudoscalar. 

If the vectors a, b, c are true, expression (VI. 3) will be a pseudosca- 
lar — it changes its sign upon inversion. Hence, the scalar triple pro- 
duct of true vectors is a pseudoscalar. 

4. Transformations of Vector Components. Let us find formulas- 
for the transformation of the components of a vector in a transition 
from one coordinate system to another. Let us take two systems of' 
Cartesian coordinates K and K' , setting them by their unit vectors 
e lt e 2 , e 3 and e', e^, e'. The arbitrary vector a can be written as 
a = 2 CjjCjj, where a/ t are the projections of a onto the axes of the- 
svstpm K. or as a = where a & are the projections of a onto 

the axes of the system K ' . Hence, 

Z.eia» = Se*o» (VI.34)- 

k h 

We multiply (VL34) by the unit vector ej: 

Z eWka'k = ^ e\e h a h (VI.35) 

h h 

By (VI. 11), we have e[e), = 6^. Consequently, of the three addends- 
on the left-hand side, only the one with k = i that is equal to =- 
— al will be non-zero. 
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The scalar product eje* equals the cosine of the angle between the 
axis x\ of the system K' and the axis x h of the system K. Designat- 
ing this cosine by the symbol a ih , we can write 

a ih = e(e ft = cos (zj, x h ) ( i , k — 1, 2, 3) (VI. 36) 

Using this notation, we can write relation (VI. 35) as 

= 2 a ih a k (i = 1» 2, 3) (VI. 37) 

ft 

Formula (VI. 37) allows us to calculate the projections of the vector 
a onto the axes of the system K' according to the known projections 
of a onto the axes of the system K. To obtain formulas for the reverse 
transformation (from K' to K), let us multiply (VI. 34) by the unit 
vector e ; . Repeating the reasoning that led us to formula (VI. 37), 
we obtain 

= (i = 1, 2, 3) (VI. 38) 

h 

Formulas (VI. 37) and (VI. 38) differ only in that in one case summa- 
tion is performed over the second subscript of a ih , and in the other 
case over the first one. 

The nine quantities a ih are not independent. Let us form the sum 
2 a im&hm- Taking (VI. 36) into account, we obtain 

m 

2 a im a km = 2 ( e i e m) ( e ft e m) 

m m 

The quantity eje m can be considered as the projection of the vector 
ej onto the axis x m of the system K\ similarly, ese m is the projection 
of the vector e* onto the axis x m . Hence, the sum on the right can be 
written as 

S (®i)pr. xm ( e h) pr, * m = e i e A = 
m 

(see formula (VI. 27)]. Consequently, 

2 a im a km~^ih (VI .39) 

' m 

It can be proved in a similar way (we invite our reader to do this) 
that 

2 v-miO-mh = fiift (VI.40) 

m 

Transformations (VI. 37) and (VI.38) can be adopted as a defini- 
tion of a vector: a vector is defined to be a set of quantities a ly a 2 , a 3 , 
which upon a transition from one coordinate system to another 
are transformed by formulas (VI. 37) and (VI.38) where a ik are 
quantities determined by formula (VI. 36). 
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The last definition of a vector can readily be extended to a space 
with any number of dimensions. Assume that we have an rc-dimen- 
sional space. Let us take a system of coordinates whose axes are 
mutually perpendicular in this space. This signifies that the unit 
vectors e lt e 2 , . . ., e n of the axes satisfy the condition (VI. 11). 
A vector in n-dimensional space (an ^-vector) is thus defined to be a 
set of n quantities a u a 2 , . . ., a n that upon a transition from one 
coordinate system to another are transformed by formulas (VI. 37) 
and (VI. 38). In summation, the dummy index runs through n val- 
ues instead of 3. The number of equations in (VI. 37) and (VI. 38) 
will also be n instead of 3. 

A scalar product of vectors can also be readily generalized for an 
n-dimensional space. Similar to (VI. 27), we shall call the expres- 
sion 


ab = S a.b. (VI. 41) 

<-i 

which is an invariant, the scalar product of two vectors having the 
components a x , a 2 , . . ., a n and b u f> 2 , . . ., b n . Vectors whose sca- 
lar product is zero are said to be mutually orthogonal (or mutually 
perpendicular). 

The concept of a vector product cannot be extended to spaces with 
other than three dimensions. 

Inversion of the coordinate axes (see Fig. VI. 3) can be treated as 
a transformation from the system K to the system K' whose coeffi- 
cients have the values 


or 


a ih- 


— 1 when i — k 
0 when i=fck 


a ih =» —6 


ik 


(VI. 42) 


By formula (VI. 37), the components of a vector upon inversion 
are transformed according to the law 


a 'i — — 2 §ih a h — — 


i.e. reverse their sign (this was already mentioned on an earlier 
page). 

Let us find the law of transformation of the components of a vec- 
tor product upon inversion of the coordinate axes. We write ex- 
pression (VI. 33) for the system of coordinates K' (obtained as a re- 
sult of inversion of the axes of the system K): 


(abji = e.'iki a 'kb\ = 2 tikia'hbi 
k.l M 


(VI. 43) 
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We have taken advantage of the circumstance that the quantities 
tiki are determined in the same way for all coordinate systems, 
owing to which upon any transformations of the coordinates, we 
have 

ej w = e iki (VI. 44) 

Let us express a* and b\ in formula (VI. 43) in terms of the un- 
primed components of the relevant vectors using relation (VI. 37) 

[ab]{ = 2 8 <ft, 2 (~&km) a m 2 ( — Sip) b p — 2 e. ihl b hm 8 lp a m b p 

h,l m p k,l,m,p 

Summation over the subscripts m and p yields 

[ab]j = 2 & ihl a h^l 

ft. l 

In accordance with (VI. 33), the last expression is [ab]j. We have 
thus established that 

[ab]< = [ab]j 

i.e. that the components of a vector product do not change in inver- 
sion. A vector product of true vectors is therefore a pseudovector. 

Let us write a scalar triple product of three vectors. By formu- 
las (VI. 27) and (VI. 33), we have 

a[bc] = 2ai[bc]j = 2«iS 2 e <ftI a,6 fc c, (VI.45) 

i i ft, l i.h.l 

Let us see how this quantity behaves in inversion. In the system K', 
we have 

(a [be])' = 2 e, m a'ib' h ci 

i ,h,l 

— 2 2 ( — d Jm ) a m 2 ( &kp)bp 2( $u) c s 

i,h,lm p s 

= — 2 ^ihi a ibh c i — — (a [be]) 

i, h, l 

We have obtained a result we already know: the scalar triple product 
of true vectors upon inversion changes its sign, i.e. is a pseudoscalar. 

5. Increment of a Vector in Rotation. Let us find the increment 
which the vector a obtains in rotation through an infinitely small 
angle dy. We introduce two coordinate systems K and K' which we 
choose so that their axes z and z' coincide with the vector dq> 
(Fig. VI. 4). Assume that the system K' turns together with the 
vector a through the angle dtp relative to the system K. Relative to 
the system K' , the vector a remains constant, while relative to the 
system K it receives the increment da. 

We shall first assume that the tail of the vector a is on the z-axis 
(Fig. VI. 4). If a is initially in the plane yz, the increment da is collin- 
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ear to the x-axis. The magnitude of this increment, as can be seen 
from the figure, is a sin a dip. It follows from the above that the 
increment of the vector a can be written as 

da = [dip, a) (VI. 46) 

We shall prove that the formula we have found also holds with an 
arbitrary arrangement of the vector a relative to the coordinate sys- 
tems K and A'. Let us introduce the 
unit vectors c x , e y , e z of the system K and 
the unit vectors e’ x , e y , e z of the system 
K'. The vector a can therefore be set by 
the expression 

a = e x a x + e y a y + e z a z (VI. 47) 
or the expression 

a = e' x a' x + e' v ay + e'a' (VI. 48) 

where a x , a y , a z are the projections of the 
vector a onto the axes of the system K, and 
a x , a'y, a't are the projections of a onto the 
axes of the system K' . 

When the vector turns together with the 
system K' through the angle dip, it receives 
an increment relative to K, that can be 
written as the increment of expression 
(VI. 47): 

da — e.^ da x 1 e^ d&y | c z da z 
or as an increment of expression (VI . < 

da = a' x dc' x + a'„ dc' u + a z de z (VI. 49) 

where de x , de( n dc' z are the increments of the unit vectors of the sys- 
tem A" observed in the system K (remember that the projections 
a x , a' ln a z remain unchanged upon rotation). 

With the direction of the z'-axis we have chosen, the increment of 
the unit vector vanishes (dc', — 0). Figure VI. 5 shows the incre- 
ments de x and de' y contained by the unit vectors e*' and e y when the 
coordinate system K' turns through the angle dip. Examination of 
the figure show's that the direction of de' x coincides with the direc- 
tion of the unit vector e y . The magnitude of de' x , on the other hand,, 
equals dip (the magnitude, i.e. the length of any unit vector is uni- 
ty). Consequently, the increment of the unit vector ei observed in. 
the system K can be written as 

dc x -- e' y dip 
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Similar reasoning results in the formula 

de y — “€3; dcp 

the minus sign is due to the fact that the vectors de' y and e* are 

directed oppositely). 

e ' ^ j L ' . Introducing the values of the unit vector 

* / x increments we have found into formula 

/ (VI. 49), we obtain 

J da — ( a’ x ey — a^e'x) dq> 

f d<f We shall show that the expression we 

d f *'y have found is equivalent to the vector 

^ *"1 product [d<p, a]. To do this, we shall express 

de'y the product in terms of the projections of 

the vectors being multiplied onto the axes 
Fig. VI. 5. 0 f fh e f rame f[' , taking into account that 

d ( p is directed along the z'-axis. In accord- 
ance with formula (VI. 32), we have 

e* Gy &z 

[d«p, a] = 0 0 dq> =(a' x ey — a' v e' x ) dq> 

a x Cly Q*z 

We have thus arrived at formula (VI. 46). 


VII. Matrices 

Definition. In Appendix VI, we obtained formulas for the trans- 
formation of vector components in a transition from the coordinate 
system K to the system K': 

a» = S (t = 1, 2, 3) (VII. 1) 

a 

<*1 = 2^4 (* = 1,2, 3) (VII.2) 

ft 

{see formulas (VI.37) and (VI.38)]. 

The transition coefficients can be written in the form of a square 
table 

«11 a 12 a 13 

A= &21 ®22 ®23 (VII. 3) 

a 31 a 32 a 33 

known as a transformation matrix. The quantities a t k are the matrix 
elements. The first subscript indicates the number of the row in 
which the given element is, and the second subscript — the number 
of the column. 
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Let us come to an agreement about our notation. We shall denote 
the elements of a matrix by lower-case letters with two subscripts, 
and a matrix by the corresponding capital letter (for instance, the 
element a i)t of the matrix A 1 ). We shall denote the components of a 
vector by lower-case italic letters with one subscript, and the vector 
itself by the same lower-case Roman (upright) letter in bold-face 
type (for instance, a t is the component of a vector, and a is the vec- 
tor itself). 

The operation (VII. 1) of transforming the components of a vector 
can be written symbolically as the multiplication of a vector and a 
matrix: 

a' = Aa (VII. 4) 

The coefficients of the inverse transformation (VI 1.2) form the 
matrix 

a il a 21 a 31 

A -1 = ®12 &22 &32 (VII. 5) 

a 13 a 23 a 33 

called an inverse matrix. Denoting the elements of an inverse matrix 
by the symbol aj*, we can write 

a' ih = a hi (VII. 6) 

The matrix obtained from A by interchanging the rows with the 
columns is called a transposed matrix and is designated by A. If we 
denote the elements of a transposed matrix by the symbol a ih , we 
can write 

a ih = a ki (VII. 7) 

A glance at formulas (VII. 6) and (VII. 7) shows that the matrix 
(VII. 5) of the inverse transformation coincides with the transposed 
matrix (VII. 3) of the direct transformation: 

A" 1 = A (VII. 8) 

Relation (VII. 8) does not hold for any matrices 2 . Matrices satis- 
fying the condition (VI 1.8) are called orthogonal. 

The inverse transformation (VI 1. 2) is written symbolically as 

a = A-V (VII. 9) 

Without altering the formal mathematical aspect of the matter, re- 
lations (VII. 4) and (VII. 9) [in other words, relations (VII. 1) and 

1 This is how the capital Greek letter “alpha” is written. 

a In general, not any matrix has an inverse one. A matrix for which no in- 
verse one exists is known as a singular or degenerate one. But even if a matrix 
is non-singular, its inverse and transposed matrices may not coincide. 
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(VI 1. 2)] can be treated not as operations of transition from one coor- 
dinate system to another, but as operations transforming one vector 
into another, both vectors being considered in the same coordinate 
system. Having in view such an interpretation, we can write the 
formulas for transformation as follows: 

b = Aa (VII. 10) 

a = A -1 b (VII.ll) 

Hence, the matrix A can be considered as a linear operator that by 
acting on the vector a transforms it into the vector b. 

Let us write the transformations (VII. 10) and (VII.ll) in the 
explicit form, and for greater generality we shall consider that the 
vectors a and b are determined not in a three-dimensional space, but 
in a space with n dimensions. By analogy with (VII. 1) and (VII. 2), 
we obtain 

bt= 2 a lk a k (i = l,2, ...,n) (VII.12) 

fc=i 

2 (< = 1,2, ...,») (VII. 13) 

fc=i 

where a' ih are the elements of the inverse transformation matrix 
(the matrix A -1 ). For an orthogonal matrix, a' ik — a ki . 

The matrices A and A -1 will now have n rows and n columns, for 
example 

“ll a l2 • • • “in 

a 21 a 22 ... a 2n (VII. 14) 

“nl “n2 • • • “nn 

The matrix (VII. 14) is square — the number of rows in it equals 
the number of columns. In addition to square matrices, rectangular 
ones are -also considered, in which the number of rows m does not 
equal the number of columns n: 




“h 

“l2 

« • • “in 


A — A(mi n) — 

“21 

“22 

... “2n (VII. 15) 



“ml 

“m2 

• • • “mn 


The first subscript on the matrix symbol indicates the number of 
rows, and the second — the number of columns. We shall drop these 
subscripts when this does not cause ambiguity. 

Hence, in the general case, a matrix is defined to be a set of m-n 
elements arranged in the form of a rectangular array. The elements 
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of a matrix may be functions, numbers, or other quantities on which 
algebraic operations can be performed. A matrix with m rows and n 
columns is said to be an m by n matrix (written m X n). An m X 1 
matrix, i.e. a matrix with one column, is called a column matrix, 
and a 1 X n matrix, i.e. a matrix with one row, is called a row 
matrix. 

Two matrices A and B are said to be equal (A = B) if the relevant 
■elements of these matrices equal one another ( a ik — p ift ). 

The matrices A and B are considered to differ only in their sign 
(A = — B) if the relevant elements of these matrices are related by 
the expression a ik = — p £fe . 

The square matrix (VII. 14) (i.e. an n X n matrix) is a particular 
case of the matrix (VII. 15). A matrix transforming a vector in an 
Ji-dimensional space into another vector in the same space will evi- 
dently be square. 

If the elements of a square matrix satisfy the condition 

a ik = a hi (VII. 16) 

the matrix is symmetric. A symmetric matrix obviously coincides 
with its transposed one: 

A S ym = A sym (VII. 17) 

A square matrix whose elements satisfy the condition 

a ih = — (VII. 18) 

is said to be asymmetric or skew-symmetric. An asymmetric matrix 
differs from its transposed one only in the sign: 

Aasym ~ A asym (VII. 19) 

A square matrix in which only the elements a;* with identical 
values of the subscripts i and k are non-zero is called diagonal. Such 
a matrix has the form 



K 

0 

... 0 

A = 

0 

^2 

... 0 


0 

0 

... K 


The elements of this matrix can be written as follows: 


(VII. 20) 


hk = KStk (VII. 21) 

where 8 ih is the Kronecker symbol [see (VI. 12)]. 

If we change the coordinate system (i.e. the basis ej, e 2 , . . ., e n ), 
the components of the vectors a and b [see formula (VI 1 .10) ] become 
different. The elements of the matrix-operator will also change. 
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Sometimes (particularly, when the matrix A is symmetric), the ba- 
sis can he chosen so that the matrix A becomes diagonal. 

In a transition from one coordinate system to another, the ele- 
ments of a matrix change, but the sum of the diagonal elements called 
the trace of the matrix (its symbol is Tr A) remains unchanged. 
Therefore, the trace of a matrix is identical in all coordinate systems, 
i.e. is an invariant: 

Tr A = 2 ajj = inv (VII. 22) 

The determinant of the matrix [see (VIII. 3)] also remains unchanged: 

det || a ik || = inv (VII. 23) 

Let us define a unit (or identity) matrix / which when multiplied 
by a vector according to the rule (VI 1.10) yields the same vector: 

a = /a 

It is a simple matter to see that the elements of a unit matrix must 
equal 8 ih [the substitution into (VII. 12) of a ih = 8 ih leads to the 
relation b t = a,l. Hence, 

1 0 ... 0 

/= II Sift 11= 0 1 /;; 0 (VII. 24) 

00 ... 1 

We must note that this matrix is diagonal. 

Matrix Algebra. Matrices are algebraic objects lending themselves 
to addition, subtraction, and multiplication (the operation of matrix 
division does not exist). 

The sum of two matrices A and B is the matrix T = A -f- B whose 
elements are determined by the formula 

Vi ft = a ih + Pift (VII. 25) 

The difference of two matrices is the matrix T = A — B with the 
elements 

Yift = a ift - Pift (VI 1. 26) 

It is evident that only matrices having the same number of rows 
and the same number of columns can be added and subtracted. 

The product of the matrix A and the scalar r| is defined to be the 
matrix B = r)A with the elements 

Pift = Tja* (VII. 27) 

Let us now consider the multiplication of matrices. Assume that 
the action of the matrix A on the vector a transforms it into the 
vector b, and the action of the matrix B on the vector b transforms 
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it into the vector c. It is natural to define the product of the matrices 
A and B as the matrix T which when acting on the vector a trans- 
forms it into the vector c. Hence, 

b — Aa, i.e. & m = 3 a mk a h 

h 

c — Bb = BAa, i.e. c t = S P<fA» = I! P/m S “*,/■«/.= 5 2 P/mCW 

m m k km 

On the other hand, 

c = Ta, i.e. c l = 'Zy tk a h 

k 

A comparison of the two formulas for c and c t leads to the rule of 
matrix multiplication: 

T = BA signifies that Yifc = 2Pi m a mJ[ (VII. 28> 

m 

According to this rule, to obtain an element of the matrix T at the 
intersection of the i-th row and the Zc-th column, we must multiply 
each element of the i-th row of the matrix B by the corresponding 
element of the fc-th column of the matrix A and summate all the pro- 
ducts. This can be explained by the following diagram: 



(VII.29) 


We must note that matrix multiplication, in general, is not com- 
mutative, i.e. 

BA^ AB 

Matrices for which the condition 

BA = AB (VI 1. 30) 

is satisfied are called commutative. 

It is easy to show that the product of matrices is associative: 

(TB) A = T (BA) (VII. 31) 

This signifies that by first multiplying B and T, and then A and 
(rB), we obtain the same result as we would by first multiplying the 
matrices A and B and then multiplying the matrix (BA) and T. In- 
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deed, according to the rule of matrix multiplication 

{(FB) A) ih — 2 (rB) im a mft = 2(2 YiiPim) <*mfc 

m m l 

= 2 Vii (2 = 2 Yi! (BA) ih = {r (BA)} ih 

l m l 

{in the course of transformations we. have changed the sequence of 
summation over the subscripts m and l). Consequently, the property 
{VII. 31) has been proved. 

Non-square (rectangular) matrices can also be multiplied by each 
other. Examination of the diagram (VII. 29) shows that such matrices 
can be multiplied only when the number of columns of the matrix B 
(the second matrix 1 ) coincides with the number of rows of the matrix 
A (the first matrix). The product matrix will have the same number 
of rows as the second matrix (B) does, and the same number of co- 
lumns as the first matrix (A) does. We shall explain this by the fol- 
lowing example: 

<*11 <*12 <*13 

Pn P12 • • • Pin a 21 a 22 a 23 

P2I P22 • • • P 2 n 

<*nl <*n2 <*n3 

(2 Put<*fti) (2 Pife«J (2 PiftaJ 

(.2 p2&<*fcl) (2 $Zk a hi) (2 P2h<*Jt3) 

If the second matrix is square, i.e. is an n X. n matrix, and the 
first matrix has only one column with n elements, the product ma- 
trix also consists of one column with n elements: 

Pll Pl2 • • • Pin 
P2I p22 • • • ?2n 

Pnl Pn2 * • • Pnn 

When a column matrix is multiplied by a row matrix, the result 
is simply a number (or a function if the matrix elements are func- 
tions): 

a i 

“ 2 = 2 P ft <*„ (VII. 33) 

* 

<*n 

1 In the matrix product BA, the matrix A at the right should be considered 
as the first multiplier. It is multiplied by a vector first of all, and only then the 
second matrix B acts on the result. 
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Particularly, if we take the transposed matrix || a || as the matrix 
j| (1 j|, Eq. (VII. 33) becomes 


l a i ... a n | 


a i 

tx 2 


y 


a| 


Consequently, for the column matrix A (nil ), the following relation 
holds: 


n 

A(n. i)A(„, 1 ) = y a| 

ft=l 


(VII. 34) 


If we take the components of the vector a as the elements of a col- 
umn matrix, and the matrix operator A as a square matrix, rela- 
tion (VI 1. 32) will become 


a ll a 12 • • • a ln 


a, 


bi 

a 2i a 22 • • • a 2n 



= 

b* 

a nl a n 2 • • • a nn 




bn 


(VII. 35) 


where b, = 2j a ifc a ft [compare with (VII. 12)]. It is not at all difficult 

h 

to see that relation (VII. 35) is equivalent to relation (VII. 10). Con- 
sequently, a vector can be represented as a column matrix. 

Consider the product of the unit matrix I and an arbitrary matrix 
A. By rule (VI 1.28) 

(A/)(ft — i Ctimbmh 

m 


(6 mft are the elements of the matrix I). In this sum, only one addend 
in which m = k will be non-zero. Therefore, (A/) ih = a**,. Simi- 
larly, 

= y 8 im a mh = a ih 

m 


It follows from the above that multiplication by a unit matrix 
(with any sequence of the factors) does not change the matrix A: 

IA = AI = A (VII. 36) 

Relation (VII. 36) signifies that a unit matrix commutes with any 
matrix A. 

It is obvious that by first applying to a vector the transformation 
A, and then its inverse transformation A" 1 , we must return to the 
initial vector: 


a = A -1 Aa 


(VII. 37) 
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This shows that the product of a direct and an inverse matrices must 
equal a unit matrix: A _1 A = I 1 . The product of a direct and an in- 
verse matrices is obviously commutative, hence 

A- J A = AA' 1 = I (VII. 38) 

Having written the elements of the product of the matrices A and 
A" 1 by formula (VII. 28), we can find the relation between the ele- 
ments of the direct and the inverse matrices: 

S == S (VII.39) 

m 77i 

For an orthogonal matrix, i.e. one satisfying the condition (VII. 8), 
we have a\ h = a ki [see (VII. 6)]. Making this substitution in (VII.39), 
we obtain 


a mi a mfc — 

m 

(VII. 40) 

5 ] a im a hm ~ $ih 

(VII. 41) 


m 


The elements of an orthogonal matrix thus satisfy relations (VII.39) 
and (VII. 40) [compare with formulas (VI. 39) and (VI. 40)]. 


VIII. Determinants 

Assume that we have the square matrix 



a n 

fl 12 


a ln 

A = 

a n 

ffl 22 


a 2n 


a n 1 

a ni 

. . . 

a nn 


(VIII.l) 


Let us form the following expression from the elements of this 
matrix: 

. .TO®li®2fc (VIII. 2) 

where t, k, . . ., m is a permutation of the numbers 1, 2, . . ., n, and 
is a quantity equal to +1 if the number of disorders (in- 
versions) 2 in the permutation i, k, . . ., m is even and to —1 if it is 
odd [compare with (VI. 15)]. The number of permutations of n num- 
bers taken n at a time is known to be n\ We can therefore compile n\ 
different expressions of the form given by (VII. 2). 


1 This relation clarifies the symbol A -1 used for an inverse matrix (I is 
“unity”). 

2 Recall that a disorder in a permutation is the fact that a larger number 
precedes a smaller one (see p. 320). 
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The sum of all the expressions having the form of (VIII. 2) is de- 
noted by the symbols 



a n 

a l2 • • 

. a ln 

D n = D (A) = det HOfftH = 

a 2 l 

a 22 . . 

• a 2 n 


a nl 

a n2 * • 

• a nn 


-1.' • • • Gnni (VIII. 3) 

(tfc. . .m) 

and is called the determinant corresponding to the matrix (VIII. 1). 
The sum (VIII. 3) is taken over all the permutations of the numbers 
i, k, . . ., m. Consequently, it contains re! addends. 

If we add to the definition of e ih ... m the condition that this quan- 
tity vanishes when the values of at least two of the re subscripts 
coincide, the determinant D (A) can be evaluated as the sum 

n 

ZJ ^ik. . .mQ'liQ'2k ••• ^nm (VIII. 4) 

i, h t . . , m= 1 

in which all the subscripts i, k, . . ., m take on values from 1 to re. 

A glance at (VIII. 3) shows that a determinant can be written as 
an array similar to (VIII. 1) with the difference that single vertical 
bars are used instead of double ones. 

The number of rows (or columns) of a determinant is said to be its 
order. 

We must note that the determinant of the diagonal matrix (VII. 20) 
equals the product of its diagonal elements: 

det || X h 8 lh || - KK ■ ■ • K (VIII. 5) 

and the determinant of a unit matrix is unity: 

£>(/) = 1 (VIII. 6) 

Let us explain what has been said above using a third-order de- 
terminant as an example: 

a l\ a l 2 a l 3 
D 3 = a .% ( fljj fla3 
O31 O32 Q33 

We can make 3! — 6 permutations from the second subscripts of the 
elements of this determinant: 

123 0 disorder -J- 

231 2 disorders + 

312 2 disorders + 

321 3 disorders — 

213 1 disorder — 

132 1 disorder — 


22» 
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By formula (VIII. 4), the expression 
D 3 — 011^22^33 "f* ®12^23®31 “t" ®13®21 a 32 — ®13®22 a 31 

a 12 a 21®33 — a ll a 23 fl 32 

is a determinant of the third order. 

If we delete the i-th row and the fc-th column in an n-order deter- 
minant, we obtain a determinant of the n — 1 order known as the 
minor of the initial determinant corresponding to the element a ih . 
This minor is customarily denoted by the symbol A ift . The quantity 

A ih = (-l) 1+ft A ife (VIII.7) 

is called the algebraic cofactor of the element a ih . 

Properties of Determinants. We shall list the basic properties of 
determinants without proving them. 

1. The value of a determinant does not change if corresponding 
rows and columns are interchanged: 


a ll 

a l2 

. . . ffli n 


a ll 

a 21 

• • • ®nl 

a 21 

a 22 

• • • a 2n 

= 

a 12 

a 22 

• • • ®n2 

On! 

®7l2 

• • • a nn 


a ln 

0-2 n 

• • • ®nn 


or, more compactly, 

det || a ik || = det || a hi || (VIII. 8) 

Interchanging of the rows and columns is called transposition. 
A determinant in which such a change has been made is said to be 
transposed. We can thus say that a transposed determinant equals 
the initial one. 

Property 1 shows that the determinant of a transposed matrix 
equals the determinant of the initial matrix: 

D (A) = D (A) (VIII.9) 

Indeed, these determinants differ only in the rows and columns 
having been interchanged, which does not change the value of the 
determinant. 

2. The sign of a determinant is changed if any two rows or any 
two columns are interchanged. 

3. If two columns or rows are identical, the determinant is zero 
(this follows from the property 2). 

4. A determinant is a linear form 1 of the elements of a row or a 


1 The linear form of the variables it, x 2 , . . ., x n is defined to be the linear 
homogeneous function of these variables, i.e. the expression 

/ = a t x t + a 2 x 2 + . • • + a n x n 
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column: 

n 

det ||a ih || = 2 A ih a ih (linear form of the elements of the i-th row) 

fc=l 

(VIII.10) 

or 

n 

det ||a ift || = 2 A ih aih (linear form of the elements of the k - th column) 

(VIII. 11) 

the algebraic cofactors (VIII. 7) of the relevant elements being the 
coeSicients A lh of the linear forms (VIII. 10) and (VIII. 11). 

5. If the elements of one row (or column) are multiplied by the 
algebraic cofactors of the elements of another row (or column) and 
the products obtained are summated, the sum will be zero (this sum 
is a determinant with two identical rows or columns; see the pro- 
perty 3). 

Properties 4 and 5 can be combined in the form of the relations 

2 Athdmk = det ||ajft|| • 6 im 
k 

2 Ai k a im — det • 6 ftm 



Two more properties (6 and 7) follow directly from the property 4: 

6. If all the elements of a row (or column) contain a common 
factor, it can be written before the determinant: 



... pa lft .. 

■ a ln 

a ll ■ • 

• Gift • • 

• a in 

aa tt 

. . . a$a ih . . 

- V- a in = 

a H • ■ 

• a ih ■ ■ 

• &ln 

a nl 

• • • 

• Ann 

| #nl • • 

* ^ Tlk • • 

• a nn 


(VIII. 13) 


7. If the elements of a row (or column) are the sum of two (or 
more) addends, the determinant equals the sum of determinants in 
which the relevant addends are the elements of the given row (or 
column), for example, 


a ll • 

. a'ik + a'ih ... 

a ln 


a ll • • 

• fl'lft • • 

a in 

a 2i . 

. a'zh + a '2h ••• 

a 2 n 

= 

a zi • • 

. a'zk . . 

a Zn 

a nl • 

• a nh "4" flnfc • • 

®nn 


a ni • • 

&nh ■ • 

®nn 





1 a ll 

. . . a"^ 

• • ^ln 


+ a Zi ... a 2k ... a zn (Vm 14) 

Qjii • * * &nk • • • ^nn 
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8. The value of a determinant is not changed if to the elements of 
any row (or column) we add the corresponding elements of another 
row (or column) after first multiplying them by the same constant 
quantity. 

This property follows from properties 7, 6, and 3. 

9. The product of two determinants of the same order, dot || a ih || 
and det j| b lh j|, is a determinant of the same order, det || c ih ||, 
whose elements are expressed by the formulas 

*i* = 2 «»»&** (VIII. 15) 

m 

A comparison of this formula with formula (VI 1. 28) shows that de- 
terminants are multiplied in the same way as the corresponding ma- 
trices. Consequently, the determinant of a product matrix coincides 
with the product of the determinants of the multiplier matrices. 

It can be seen from the property 9 that the determinant of an ortho- 
gonal matrix is ±1. Indeed, for an orthogonal matrix A = A" 1 
[see (VII. 8)], owing to which AA = I. According to what has been 
said above, 

D (A) D (A) = D (I) = 1 

[see (VIII. 6)]. But by (VIII. 9), we have D (A) = D (A), so that we 
can write 

[D (A)] 2 = 1 

whence 

D (A) = ±1 (VIII. 16) 

Systems of Linear Non-Homogeneous Algebraic Equations. Con- 
sider a system of n linear algebraic equations with n unknowns, 

3-1 1 x 2i • • -i ^n- 

„ x \i x \ -j- “I* • • • "f - a m x n = bi 

a 2l x l "f" < 1 22 x 2 H~ • • • 4” a 2n x n = \ 
a nl x l "f" a n 2 x 2 ”t“ • • • 4“ a nn x n = 

This system can be written as a single expression: 

2oik*k = 6| (i = l, 2, .... ») (VIII. 17) 

ft=i 

The coefficients at the unknowns can be seen to form a square 
matrix similar to the matrix (VIII. 1). Assume that the determinant 
of this matrix (we shall call it the determinant of the system) is 
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non-zero: 


11 

a n ■ 

■ ■ a ln 



21 

a 22 • 

• • n 

=7^0 

(VIII. 18) 

•n 1 

a m ■ 

• • ®nn 




We multiply the first of Eqs. (V11J.17) by A lm — the cofaclor of the 
element a lm , the second equation by A 2m , • • •, the n-th equation 
by A nm , and summate the expressions obtained. The result is 

2 A im "2 a ih^h ~ 2 Ai m b t 

i=l fc=l i= 1 


We change the sequence of summation on the left-hand side of the 
equation: 

n n n 

2 % h 2 = 2 A im b t 

ft= 1 i=l i=l 


n 

By the second of formulas (VIII. 12), 2' A im a ih = det || a ih |1 8 mh = 

i=l 

= D8 mh , where D is the determinant of the system. Hence, the 
obtained relation can be written as 


n 


n 


2 = 2 A im bi 

k=i i=l 


Summation on the left over k yields the product x m D. A comparison 
of the sum on the right with formula (VIII. 11) allows us to conclude 
that this sum is the determinant obtained from the determinant 
(VIII. 18) by substituting the free terms of the system (VIII. 17) 
for the elements of the m-th column. Denoting this determinant by 
the symbol D (m> , we can write that 


whence 


x m D = D< m > 




D W 

D 


[m has been replaced with k for the subscript on x to be designated 
by the same letter in the given formula and in formula (VII 1. 17)1. 

We have arrived at Cramer’s rule, which states: if the determinant 
of a system is non-zero, it has one definite solution, the value of the 
unknown x h being equal to a fraction whose denominator is the deter- 
minant D of the system, and whose numerator is the determinant 
obtained from D by replacing the elements of the k-th column with the 
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free terms of the system: 


(VIII. 19) 


The solution of the system of equations (VIII. 17) can be made 
very clear by using the following representation. The unknowns 
x ly x 2 , ■ • ., x n can be considered as the components of a vector x 
in an re-dimensional space, and the free terms b 1? b 2 , . . b n as the 
components of a given vector b. Now the system (VIII. 17) can be 
written symbolically as the relation 

Ax = b (VIII. 20) 

where A is a matrix compiled from the coefficients of Eqs. (VIII. 17): 



a u 

a i2 

• • • a ln 


a 2l 

fl 22 

• • • ®27l 


a nl 

a nl 

• • • ®nn 


Indeed, expanding relation (VIII. 20) by formula (VII. 12), we obtain 
re equations: , 

n 

2 a lk x k = b t (i = l, 2, .... re) 
fc= l 

coinciding with the system (VIII. 17). 

Conse'quently, the problem of finding the unknowns x t can be 
formulated as follows: we are given the matrix A and the vector b 
in 'an re-dimensional space; it is necessary to find a vector x such 
that when multiplied by the matrix A transforms into the given 
vector b. 

We multiply Eq. (VIII. 20) by the matrix A~ x . On the left, we 
obtain the required vector x (see formula (VII. 37)], and we arrive 
at the relation 

x = A~ x b (VIII. 21) 

Hence, to find the solutions of the system of equations (VIII. 17), 
we must proceed as follows: find the matrix that is the inverse of the 
system’s matrix and substitute the elements of this matrix into the 




a ll a 12 • • 

A . 

• • a ln 

n 


a 21 ®22 • • 

.b 2 . 

• • ®2n 

2 A lh b i 

D w 

a ni a n 2 • • 


• • ^nn 

D 

D 

a lt a 12 . . 

■ a ik 

• • • a in 



a 21 fl 22 • • 

■ a 2k 

■ ■ ■ a 2n 



a n i ®712 • • 

• & nh 

• • • ®nn 
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formulas 

= (VIII.22) 

i 


[see formulas (VII. 11) and (VII. 13)]. 

Comparing formulas (VIII. 19) and (VIII.22), we arrive at the 
conclusion that the elements of the inverse matrix are determined 
by the expressions 


= 


Aik 

D 


(VIII. 23) 


We shall indicate another form of writing the relations we have 
considered. Representing the vectors x and b as matrices with n 
rows and only one column, we can write the svstem of equations 
(VIII. 17) as 


a li a 12 • 

• a \n 


*1 



a 21 a 22 • 

■ a 2 n 


X 2 

= 

h 

a ni a n2 • 

• ®nn 


Zn 


K 


(VIII. 24) 


or, more briefly, 


1 (n,n) 


■ x 


(n,l) 


= B 


(n.l) 


(VI 1 1.25) 


[compare with (VII. 35)]. 

Systems of Linear Homogeneous Equations. The system of equa- 
tions (VIII. 17) in which all the free terms b t are zero is said to be 
homogeneous. Hence, a homogeneous system has the form 


4~ ^1 2*^2 4* ••• 4“ ^ 

4~ ^22*^2 4" • • • 4“ ^2 n%n ~ ^ 

4- 0 - 712^2 4- ... 4- a nn x n = 0 , 


(VI 1 1.26) 


(we are considering only systems in which the number of equations 
equals the number of unknowns). 

If the determinant of this system is non-zero, according to Cramer's 
rule the system has one definite solution, which in the given case is 
zero: 

4 1 A(kbi 

a; ft== _L_ = 0 '& = !, 2, .... n) 


[see (VIII. 19)]. 

Consequently, for a homogeneous system of equations to have a 
non-zero solution, its determinant must be zero. It can be proved 
that this condition is not only necessary, but also sufficient. 

To be able to discuss the nature of the solutions of the system 
(VIII. 26), we must acquaint ourselves with the concept of the rank 
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of a matrix. If the number of rows to of a matrix differs from its 
number of columns n, the determinant (VIII. 4) cannot be compiled 
for it. But by deleting certain rows and columns from the matrix, 
we can form a determinant from the remaining rows and columns. 
The determinants obtained in this way are said to be included in the 
composition of the matrix. The highest possible ordor of these deter- 
minants equals the minimum of the numbers to and n determining 
the size of the matrix, while the smallest order of these determinants 
is unity, the first-order determinants being elements of the matrix. 

Assume that all the determinants of the order l in the composition 
of a matrix are zero. Hence, all the determinants of the order (l -f- 1) 
are also zero (this follows from the property 4 of determinants). In 
other words, if all the determinants of the order l in the composition 
of a matrix are zero, all the higher-order determinants are also zero. 

The highest order of a non-zero determinant in a matrix is called 
its rank. Hence, the fact that the rank of a matrix is r signifies that 
among the r-order determinants in the matrix at least one is non- 
zero; all the determinants of a higher order are zero here. 

The concept of the rank, naturally, may also be applied to a square 
matrix. For example, with observance of condition (VIII. 18), the 
rank of the matrix formed from the coefficients of the non-homogen- 
eous system of equations (VI 1 1. 17) is n. 

For the homogeneous system (VIII. 26) to have a non-zero solution, 
its determinant must be zero. In other words, the rank of the matrix 
formed from the coefficients of the system must be less than n. Assume 
that the rank of the system’s matrix is r (where 1 ^ r < n). Here 
there are n — r linearly independent solutions: 

x(“>, 4“>, . . ., xW (a = 1, 2 n-r) 

The values of the unknowns determined by the expressions 

at, = (« = !. 2, .... ») (VIII.27) 

a 

where c a are arbitrary constants, will be the most general solution. 

In the particular case when the rank of a system’s matrix is r = 
— n — 1, there is only one linearly independent solution. The 
following values of the unknowns can be proved to be this solution: 

Xi — cAki, x 2 — cAf j 2 , . . x n = cA h n (\ III. 28) 

where A ki is the algebraic cofactor of the element a si in the deter- 
minant D of the system, c is an arbitrary constant, and k is selected 
so that at least one of the A ft /s (where i = 1, 2, . . ., «)is non-zero. 
The values of Aki obtained with a different selection of k correspon- 
ding to the condition indicated above differ from one another by 
a common factor that can be included in the constant c. Hence, the 
form of the solution (VIII. 28) does not depend on how k is selected. 
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Assume that a set of values x t = g f satisfies the system (VIII. 26). 
It is not difficult to see that the values xi — X q t (here X is an arbi- 
trary constant) also satisfy the system. This explains the presence of 
the factor c in formulas (VI 1 1. 28). It follows from the above that the 
system (VIII. 26) uniquely determines only the ratios xjx h , while 
the values of x t themselves are determined to within an arbitrary 
factor. 

The problem of solving a system of homogeneous equations can be 
given the following geometrical interpretation. We shall consider 
the set of quantities (aq, x 2 , ■ . ., x n ) as an re-vector x; similarly we 
shall consider the set of quantities ( a tl , a i2 , ■ . ., a in ) as an re-vector 
a ; (there will be re such vectors). The system (VI 1 1 .26) can therefore 
be written as 


a;X = 0 (i = 1, 2, . . ., re) 

[see formula (VI. 41)], and the problem itself formulated as follows: 
re vectors are set in an re-dimensional space. It is necessary to find 
a vector x such that would be perpendicular to all the vectors a*. 

It is evident that the multiplication of the vector x by the scalar c 
does not violate its orthogonality to the vectors a ; . Therefore, the 
unknowns x lt x 2 , . . ., x n are determined by the system (VIII. 26) 
to within the arbitrary factor c so that the value of one of the un- 
knowns can be chosen arbitrarily (for instance, we can assume that 
x 1 = 1); now the values of the remaining unknowns will be deter- 
mined uniquely (they will be expressed in terms of £j). 

IX. Quadratic Forms 

A quadratic form / of the variables x lt x», . . ., x n is defined to 
be a homogeneous polynomial of the second degree in these variables. 
Such a polynomial can be written as 

/= 3 a lh x t x h (IX. 1) 

i, h = 1 

where 

C'ih = a ki (IX. 2) 

are constant quantities that may be either real or complex. If all 
the coefficients a ih are real, the quadratic form is said to be real. 
If at least one of the coefficients is complex, the quadratic form is 
said to be complex. 

A real quadratic form is called positive definite (negative definite) 
if this form has positive (negative) values for any real values of the 
variables x u x 2l . . x n not equal to zero simultaneously. 
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In the following, we shall consider only real quadratic forms. 
The symmetric matrix 



a n 

a 12 . 

• • a in 

A = 

a 2l 

a 22 ■ 

• • #2 n 


a nt 

&n2 • 

• • ®nn 


(IX. 3) 


compiled from the coefficients of the polynomial (IX. 1) is called 
a matrix of a quadratic form. It is obvious that a quadratic form is 
completely determined by its matrix. 

The determinant 

D (A) = det |i a ik || (IX.4> 


compiled from the coefficients of a quadratic form is called its 
discriminant. 

A quadratic form can be written as the product of three matrices: 



a ll a 12 • 

• • a m 


x l 

/= || XiX 2 ... x n || 

^2i ^22 • 

• • ®2 n 


X 2 


a ni a m • 

• • 


X n 


= XAX 


(IX. 5) 


where X is a column matrix, and X is its transposed matrix. Indeed, 
the product of a column matrix and a square matrix is a column 
matrix [see (VII. 32)] whose elements in the case being considered are 


(j = 1 . 2, . .., n) 

h 


The product of a column matrix and a row matrix is simply a function 
[see (VII. 33)]. In the given case, this function is 

S x i 2 aih x h 

i k 

The last expression can be seen to be identical to expression (IX. 1). 
A quadratic form such as 

/can-SM (IX. 6) 

A— 1 

containing no terms with products of different variables is known 
as a canonical one. Such a form has a diagonal matrix and is there- 
fore also called a diagonal quadratic form. It is obvious that if all 
the X h ’s are greater (smaller) than zero, the form (IX. 6) will be 
positive definite (negative definite). 

Any quadratic form can be reduced to a diagonal form with the 
aid of a non-singular linear transformation 1 . Let us go over from the 

1 A linear transformation B (where B is a matrix) is called non-singular 
when its determinant is non-zero: D (B) =£ 0. 
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variables x ly x 2 , . . ., x n to the new variables y l5 y 2 , . . ., y n associated 
with the previous variables by means of the linear relations 

= 2 b lk y k (IX. 7) 

ft 

Let us consider the collection of quantities x t and the collection of 
quantities y h as column matrices. This allows us to write formula 
(IX. 7) as 

X — BY (IX. 8) 

where B is the matrix of the linear transformation whose elements 
are the coefficients b ih . 

Introducing the values of x t determined by relations (IX. 7) into 
formula (IX.l), we find an expression of the quadratic form in the 
new variables: 

/ = 23 flf ft 2 buy I 2 b hm y m = 2 yam 2 a ihbub hm = 2 c lm y ,y m (IX .9) 

i.ft l m l,m i. ft l, m 

where 

Clm^Hlaihbilbkm (IX. 10) 

i, ft 

It is not difficult to see that the condition a ih — a hi results in 

£ lm ~ Cml' 

Let us write expression (IX.10) as follows: 

c lm ~ y b t i y, a ik b hm — 2 b n y a lhbhm 
i ft i "ft 

[we have replaced the elements of the matrix B with the correspond- 
ing elements of the transposed matrix B, see formula (VII. 7)]. 
According to (VII. 28), 2 a ihb h m is (A5) im — an element of the 

ft 

matrix obtained by multiplying the matrices B and A. Similarly, 
2 bn ( AB ) i m is (BAB) lm — an element of the matrix obtained bv 

multiplying the matrices (AB) and S. Consequently, the matrix C 
whose elements are determined by formula (IX. 10) can be written as 

C = &AB (IX.ll) 

By (VIII. 15), the determinant D ( C ) of the matrix C is 

D (C) =D (B) D (A) D ( B ) 

Since the matrices B and B differ in the columns being interchanged 
with the rows, while the determinant does not change in this case 
[see (VIII. 8)], we have D ( B ) — D ( B ), and we can write 

D (C) — D (/l) ID (B) l a 


(IX. 12) 
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Hence, in a linear transformation of variables, the discriminant of 
the quadratic form is multiplied by the square of the determinant of 
the transformation from the new variables to the initial ones. 

If the transformation B is orthogonal [this signifies that the coeffi- 
cients b ih satisfy the conditions (VII. 40) and (VII.41)], the transposed 
matrix coincides with the inverse one: B = S" 1 [see (VII. 8)1. Con- 
sequently, for an orthogonal transformation, formula (IX. 11) is as 
follows: 

C = B~ X AB (IX. 13) 

The matrix C determined by relation (IX. 13) is a matrix of a 
quadratic form in the new variables y h . Let us find an orthogonal 
transformation B such that the matrix C will be diagonal, i.e. will 
have the form 


1 

^1 

0 .. 

. 0 

II 

> 

II 

0 

K.. 

. 0 


0 

0 .. 

. K 


(IX. 14) 


Now the quadratic form in the new variables y k will be canonic 
[see formula (IX. 6)]. 

We multiply both sides of formula (IX. 13) by B. Since BB~ l = /, 
and multiplication by a unit matrix does not change the second 
factor [see (VI 1. 36)], we arrive at the relation 


BC = AB 


or 

2 blm c mk = Zj a irrJ J mk (i, k — 1 , 2, . . . , 7 1) (IX. 15) 

m m ' 

In accordance with (IX. 14), the elements c mk can be written as 
c mA = k m S mk . Substitution of this value of c mh into formula (IX. 15) 
yields 

(*i k—\, 2, . . . , 7l) 

m m 


In the sum on the left, only the addend with m — k will be non-zero, 
and it equals b ik X k . We thus arrive at the equation 

b ik hk = 2 (i, k — i, 2, ...,n) (IX. 16) 


Expression (IX. 16) can be treated as a set of w 2 equations with n 2 
unknowns b ih . These equations can be divided into n groups (differing 
in the values of the subscript k). Each group consists of n equations 
differing in the values of the subscript i. Transferring all the terms 
in (IX. 16) to one side, we can write the A:-th group of equations 
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in the form 

( a ii — a 12 ^ 2 ft + • • • + &lnbnh — 0 

a 2 A* + ( a 22 — ^ h ) bzk + ■ • • + a 2nbnk = 0 


^nAft + a n2&2ft +”-+( fl nn — X h ) b nh = 0 

For the system of equations (IX. 17) to have non-zero 
its determinant must be zero (see Appendix VIII): 

a il ^ a l2 • • • a in 

U 2 i A. . . . d 2 n __ q 

d n i &ri2 ■ ■ • &nn ^ 

(we have dropped the subscript on X because similar conditions are 
obtained with any k). This expression is an algebraic equation of 
the n-th degree in the unknown X. It is called the characteristic 
equation of the matrix A. 

Equation (IX. 18) has n roots: X v X 2 , . ■ ., X n , which are elements 
of the required diagonal matrix (IX. 14). Introducing the values of 
Xfc in turn into the system (IX. 17) and solving this system relative 
to the unknowns b ih , we find the elements of the matrix of the tran- 
sition from the variables y h in which the quadratic form is diagonal 
to the previous variables x t [see (IX. 7)]. The transition from the 
variables x t to the variables y h can be performed with the aid of 
the inverse matrix J5* 1 . Since in accordance with our condition 
D ( B ) =4= 0 (the transformation B is non-singular), an inverse matrix 
exists; its elements can be evaluated by formula (VIII. 23). 

The roots of Eq. (IX. 18) (i.e. the quantities X h ) are real. This 
follows directly from relations (IX. 16), if we take into account the 
real nature of the quantities a ih and b ih . The matrix (IX. 14) will 
thus be real. 

Consider the following diagonal quadratic form: 

/=£*! (ix. i9) 

i 

It is positive definite. Its matrix is the unit matrix 

A = /— (I 8, || (IX. 20) 

Let us apply the arbitrary orthogonal transformation B to the 
variables x t and see what the quadratic form (IX. 19) will be in the 
new variables. By (IX. 13), the matrix C of the quadratic form in 
the new variables is determined by the expression 

C = B-HB = B-'B = I 


• (IX. 17) 
solutions, 

(IX. 18) 
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(see formulas (VII. 36) and (VII. 38)). Consequently, the quadratic 
form (IX. 19) in the new variables is 

/=Syf 

i 

Any orthogonal transformation of variables thus leaves a quadratic 
form of the kind given by (IX. 19) unchanged. 

Let us consider the real quadratic form 

/= 2 U m (IX. 21) 

ft, m 

where z v z 2 , . . ., z n are complex quantities 1 . We shall represent 
z m as x m -f iy m , and zj correspondingly as x k — u/ ft . Hence, 

/= 3 a hm (x h — iy k ) {x m + iy m )= 2 a hm (x fe z m 4-z/ h !/ m ) 

ft, m ft, to 

+ i 2 «ftm (Xhy m — ^mi/ft) = /i + i/ 2 (IX. 22) 

ft, m 

Let us interchange the dummy indices k and m in the sum deter- 
mining the imaginary part / 2 in (IX. 22): 

h= 2 a ftm(^fty m — XmVh)= 2 a mk (x m y h — x h y m ) 

ft, m to, ft 

~ 2 a hm ( x hl/m x mVh ) ~ /j 

ft, m 

(we have taken advantage of the fact that a mk — a fem ; see (IX.2)]. 
The relation / 2 = — / 2 is possible only when / 2 = 0. We have thus 
proved that the quadratic form (IX. 21) has real values at any com- 
plex z’s. By (IX. 22), let us write it as 

/ = ^_J ^■km x h x m~\~ ZJ ®ftmi/ftJ/m “ fo ( x k) "4" fo (i/ft) (IX. 23) 

ft, m ft, to 

where 

- fo (5ft) ~ ^ftm5ft5m 

ft, m 

(recall that a quadratic form is determined completely by its matrix 
and does not depend on the designation of the variables). 

Simultaneous Reduction of Two Quadratic Forms to a Diagonal 
Form. Assume that we have two quadratic forms: 

/t = 2 a ih x t x k (IX. 24) 

i, ft 

/* = 2 b ih x t x h (IX. 25) 

4, ft 

1 Recall that a quadratic form with the real coefficients a hm is said to be 
real. The form (IX. 21) is a particular case of the Hermitian form in which the 
coefficients, generally speaking, are complex and satisfy the condition ai h — 
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We shall show that if one of them, say / x , is positive definite, it is 
possible to find a linear transformation of the variables that reduces 
both forms to a diagonal one. We shall carry out the required trans- 
formation in several steps. First, using the orthogonal transformation 
F, we shall pass over to the variables v t in which the form (IX. 24) 
acquires a diagonal form: 

/i = IWi 

i 

(such a transformation was considered earlier in detail). 

Now let us pass over from the variables v t to the variables 

U t 

(it can readily be seen that with at least one (Xj # 1, this transfor- 
mation is not orthogonal). Since the form is positive definite, all 
the coefficients |x ; are positive so that the variables u t will be real. 
In these variables 



hence, I will be the matrix of f v 
Finally, let us go over with the aid of the orthogonal transfor- 
mation G from the variables u t to the variables y t such that the form 
f 2 will become diagonal. The form here remains diagonal because, 
as was shown above, a quadratic form with the matrix I [see (IX. 20)] 
does not change in any orthogonal transformation. Consequently, in 
the variables y t , the quadratic forms (IX. 24) and (IX. 25) will be 
diagonal: 

( IX - 26 ) 

t t 

The entire sequence of transformations can be represented by the 
diagram: 

(transformation F) ( X V Hj) (transformation G) 


To establish a method of finding the coefficients let us compile 
the auxiliary quadratic form 

f = f 2 — Xfi = yj (b lh ~ Xa ik )x t x k = 2 {K — ty yl (IX. 28) 
». ft 


23—018 
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whose coefficients contain the parameter X. The discriminant of this 
form in the variables x t is 

det || b ih — Xa ih || (IX. 29) 

and in the variables y t 

det || (K - X) 8 ih || = (X, -X)(X 2 -X). .. (X n - X) 

(IX. 30) 

Denoting the transformation of the direct transition from the 
variables x t to the variables y t by the letter B (this transformation, 
generally speaking, will not be orthogonal), let us write the following 
expression for the matrix of the quadratic form (IX. 28) in the 
variables y t : 

C = BAB 

where A stands for the matrix of the form (IX. 28) in the variables x t 
[see formula (IX. 11)). According to (IX. 12) 

D (C) — D (A) [D ( B )] 2 

where D ( C ) is the determinant (IX. 30), D (A) is the determinant 
(IX. 29), and D ( B ) is the non-zero determinant of the matrix of the 
transformation B in which the parameter X is absent [D ( B ) 0 

because the transformation B is non-singular)]. 

Hence, 

(h - X) (X 2 - X) . . . (X n -X) = det || b ih - Xa ih || [D (B ) ]» 

The substitution of any of the values of X h for X makes the left-hand 
side of the equation and, consequently, the factor det || b ih — Xa ih || 
vanish. Therefore, the quantities X h are the roots of the equation 

i’ll — ^ a ll ^12 — ^ a 12 ••• bm~^ a in 

bij} X(l-2\ &22 ~~ X(i ^ 2 • ■ . b‘£n Xd‘2ji 

bn i Xd nt b n o Xa n 2 ... b nn Xa nn 

The task of reducing the quadratic forms (IX. 24) and (IX. 25) to a 
diagonal form thus consists in finding the roots of Eq. (IX. 31). 
Let us consider in addition to the quadratic form in the variables x t 

/ = 2 a ih xtx h (IX. 32) 

i, h 

the similar quadratic form in the variables x t 

/' = 2 aihXtXh (IX. 33) 

i, h 

where x t is the derivative of the variable x t with respect to a para- 
meter t. Let us call x t the i-th velocity. 



= 0 (IX. 31) 
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In the linear transformation 

= (IX. 34) 

771 


from the variables x ( - to the new variables y t , the velocities experience 
the same transformation: 


= 2 


(IX. 35) 


We express the forms (IX. 32) and (IX. 33) in the new variables. 
To do this, we introduce expressions (IX. 34) and (IX. 35) into 
formulas (IX. 32) and (IX. 33): 

/ — 2 ^ik^i^k ~ 2 ®ift 2 c imi/m 2 c hlUl 
i, k i, k m l 

2 l/ml/ 1 2 Q'ik*' im^hl ~ 2 
m, l i, ft m, l 

where ■ 

b m l — 2 a ik c im c hl (IX.36) 

i, h 

Similarly 

/' = 2 a th x t x h = 2 a, -ft 2 c imy m 2 Cftd/i 

i, h i, k m l 

= 2 y mV l 2 a ih C im C hl = 2 

m, I i , ft m. ! 


where b mi has the same values (IX.36) as in the preceding case. 

Hence, in any linear transformation from the variables x t to the 
new variables y t , the coefficients of the quadratic form of the velo- 
cities Xj are transformed in exactly the same way as the coefficients of 
the similar quadratic form of x,-. On these grounds, the diagram 
(IX. 27) can be modified as follows: 


(transformation F) {xV M- £ ) 
x, — > v, 


1 1 a ih x,x h - 

i. ft 

2 b ih x jXft- 


i, h 


'i 

V *2 

2 n f !>i 

i 

2 b’ ih ViV h; 

i, h 


(transfoi mation G) 

“» -> y t 


2»t 


V *2 

hyi 


2 bi h u t u h -> 2 Ky\ 

i, k i 


The quadratic form 2 “fft^i^ft is assumed to be positive 


(IX. 37) 
definite. 


X. Tensors 

1. Definition of a Tensor. To arrive at the concept of a tensor, 
let us consider the polarization of an anisotropic dielectric. 


23 * 
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In an isotropic dielectric, the polarization P is proportional to the 
electric field strength E: 


P — XE 


(X.l) 


where % is the dielectric susceptibility. According to (X.l), the 
vectors P and E are collinear. 

In an anisotropic dielectric, the polarizability differs in different 
directions. As a result, the direction of the vector P, generally 

speaking, does not coincide with 
that of the vector E. Experiments 
show that in any anisotropic di- 
electric there are three mutually 
perpendicular directions such that 
when the direction of E coincides 
with one of them, the vector P is 
collinear with E. These directions 
are called the principal ones. Let 
us direct the coordinate axes along 
the principal directions of a dielec- 
tric (Fig. X.l). The arbitrarily 
directed vector E can be resolved 
into the components E x , E B , and E z 
(the last component is perpendicular to the plane of the drawing). 
The component E x will set up the polarization P x = % X E X collinear 
with it, where % x is the susceptibility in the direction of the x-axis. 
Similarly, the other two components will set up V y — XyE y and 
P* = X*E Z . It is not difficult to note that with different values of 
% x , Xt/> and Xr the resultant vector P = P x + P y + P z will not be 
collinear with E. 

: Let us take an anisotropic dielectric which we shall consider to 
be a homogeneous unbounded medium. We associate with it a Car- 
tesian coordinate system whose axes are oriented absolutely arbi- 
trarily and coincide with none of the principal directions of the 
dielectric.' With the field E x directed along the z-axis, not only P x , 
but also P v and P z will be non-zero, and 

Px = XxxE x , Py ~ XyxE X i Pz ~ Xz X E x (X.2) 

where Xxxi Xyxi and Xzx are coefficients of proportionality between 
■E x and the relevant components of P. 

Similarly, the fields E y and E z will cause the polarizations 



Px — XxyEy, Py — XyyEyy P z — XzyEy 1 

Px — XxzPzi Py ~XyzEz> Pz~XzzE% 1 


With a field E not coinciding with any of the coordinate axes, 
E x , E y , and E z will exist simultaneously so that all the Pi’s deter- 
mined by formulas (X.2) and (X.3) will appear. Combining the 
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relevant components of the vector P, we find that 

Px ~ XxxEx ”f" XxyEy ■)“ XxzE z "I 

Py = XyxEx ~f" XyyEy -f- XvzE z / (X.4) 

Pi = XzxEx "l" XzyEy~\~ Xz Z E z ) 

Using numerical subscripts instead of letters, we can write equations 
(X.4) in a compact form: 

Pt^hXikEk (i = 1,2,3) (X.5) 

ft 

It follows from the above that to characterize an anisotropic 
dielectric, we must set nine quantities % if£ (a single quantity x 
was sufficient for an isotropic dielectric). 

Let us now go over from the previous coordinate system x x , x 2 , x a 
(the system K) to a new system x[, a;', x' 3 (the system K') whose axes 
also do not coincide with the principal directions of the dielectric. 
We shall find out how the quantities x ih transform in such a tran- 
sition. In the new system of coordinates, the equations relating P[ 
and E'k are similar to equations (X.5): 

P'i = I,%ikE’ k (X.6) 

ft 

Here Xift are n * ne quantities characterizing the dielectric in the 
new coordinate system. 

By formulas (VI. 37) and (VI. 38), the components of the vector P 
in the transition from the system K to the system K' are transformed 
by the formula 

n=S« t nP« (X.7) 

i 

and the components of the vector E in the transition from the system 
K' to the system K are transformed by the formula 

E m = %a hm E' h (X.8) 

ft 

(Recall that a ih — eje ft is the cosine of the angle between the i-th 
primed and the fe-th unprimed coordinate axes.) 

Let us replace Pi with E m in (X.7) according to relation (X.5). 
The result is 

P'i = 2 “ii-Pi = 2 “n 2 XlmEm = 2 auXimEm 

l l tn l,m 

Now let us substitute into this equation E m from formula (X.8): 

Pi = 2 GCilXlm 2 & , hmP'k = 2 Ek 2 ^il^hmXlm 
l, m ft k l, m 
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Comparing the expression obtained with expression (X.6), we 
find that 

Xik = 2 G'il&hmXlm (X.9) 

5 m 

The set of the nine quantities T ik that transform in the transition 
from the coordinate system K to the system K' by the formula 


Tik = S a ll a kmTlm (X.10) 

- I, m 

is known as a tensor of the second rank (or a tensor of the second 
order). 

The reverse transformation (from the system K' to the system K) 
is performed by the formula 

— a li a mhTlm (X.ll) 

l, m 

A tensor is written in one of the following three ways: 

(T a T 12 T l3 \ 

T = (T ih ) = I T 2t f 22 f 23 (X.12) 

\T 31 T 32 T 33 ) 

The quantities T t k are called the components of a tensor. The com- 
ponents T n , T» 2 , and T 33 are said to be diagonal. 

Hence, the properties of an anisotropic dielectric are described by 
the dielectric susceptibility tensor 


/Xu X 12 Xi3\ 

(Xift) = I X 21 X 22 X 23 (X.13) 

V X31 X32 X33 / 

Of special interest is the case when the coordinate axes coincide 
with the principal directions of a dielectric. Now the field component 
Ei sets up only the i-th component of the polarization, and 

Pt = XuEi ( i = 1, 2, 3) 

A comparison with (X.5) leads us to the conclusion that only the 
diagonal tensor components are non-zero in the present case so that 
the dielectric susceptibility tensor is 

/Xt 0 0\ 

(X,-*)= 0 X2 0 (X.14) 

\0 0 xJ 

(we have left only one subscript on the non-zero tensor components 

because both subscripts are the same for diagonal components). 
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A tensor in which only the diagonal components are non-zero 1 
is said to be reduced to the principal axes. The values of the diagonal 
components obtained in this case are called the principal values of 
the tensor. 

We must note that in an isotropic dielectric all three principal 
values of the dielectric susceptibility tensor arc the same: Xi = Xi — 
— X 3 — X- F° r an isotropic dielectric, any three mutually perpendicu- 
lar directions can be taken as its principal directions. 

Tensors not only of the second rank, but also of other ranks are 
considered. For instance, a tensor of the third rank is a collection of 
27 quantities T ikl that transform in a transition from one coordinate 
system to another by the formula 

Tihl — S ( Zim r J-kpV-lsT mps (X.15) 

m , p, s 

Tensors of other ranks are determined similarly. A tensor of rank 
r has 3 r components. It is a simple matter to see that a vector is a 
tensor of the first rank (it has 3 1 = 3 components), and a scalar is 
a tensor of the zeroth rank (it has 3° = 1 component). 

The concept of a tensor can readily be extended to an rc-dimensional 
space. A tensor of rank r in such a space (an n-tensor of rank r) is 
defined to be a set of n T quantities T ih ,., p (altogether r subscripts) 
that transform according to a formula differing from formula (X.15) 
only in that the dummy indices take on n values instead of three 
in summation. 

We shall consider some more examples of tensors of rank two. 
Let us take two vectors a and b and form products of the kind 

n ift = a t b h (X.16) 

from their components. It is not difficult to see that these products 
are transformed by formula (X.10), i.e. have properties of the com- 
ponents of a tensor of the second rank. 

The tensor 

/\ 0 0\ 

(61*)= 0 1 0 (X.17) 

\0 0 1 / 

is called a unit tensor. By transformation formula (VI. 39), its com- 
ponents in the new coordinate system are 

l, m l 

[we have taken advantage of the property of the coefficients 
expressed by formula (X.39)]. The components of a unit tensor 

1 We must note that this is possible only with specially chosen coordinate 
axes. 
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are thus identical in all coordinate systems. Tensors having this 
property are said to be invariant. 

2. Tensor Algebra. Let us consider the fundamental operations 
with tensors. 

The sum of the tensors T ik and G ih is defined to be a tensor with 
the components 

2|ft — T ih + G ih (X.18) 

In accordance with (X.18), any tensor can be written as the sum of 
two (or more) tensors. 

The product of the tensor T ih and the scalar a is defined to be the 
tensor G ih with the components 

G lh = a T lh (X.19) 

The product of the tensors T ih and Gi m is defined to be the tensor 
Iliftjm of the fourth rank with the components 

LiftJm — TihGim (X.20) 

The product of tensors of other ranks is determined similarly. Par- 
ticularly, we have considered [see (X.16)] the product of tensors of 
rank one, i.e. vectors. Examination of definition (X.20) shows that 
the rank of a product tensor equals the sum of the ranks of the mul- 
tiplier tensors (tensors of different ranks can also be multiplied). 

By the contraction of a tensor is meant the following operation: 
two subscripts on the tensor components are assumed to be the same 
and summation is performed over them. The quantity obtained as 
a result of such an operation is called the contraction of the tensor. 
Contraction obviously lowers the rank of a tensor by two units. 
The contraction operation can be used for tensors of a rank not lower 
than the second one 1 . For a tensor of the second rank, its contraction 
is a tensor of the zero rank, i.e. a scalar. The latter is called the 
trace of the tensor (compare with the trace of a matrix, p. 334). It 
equals the sum of the diagonal components: 

Tr (T ih ) = ZT tt (X.21) 

i 

A scalar does not change upon the transformation of coordinates. 
Consequently, the trace of a tensor is an invariant. For example, the 
trace of the tensor (X.16) is the scalar product of the vectors a and b 
which, as we know, is invariant relative to the transformation of the 
coordinates [see the text following formula (VI. 28)]. 

In physical applications, the multiplication of tensors is custo- 
marily used in combination with the subsequent contraction of the 


1 For a tensor whose rank r is higher than the second, contraction can be 
lerformed in several ways (over different pairs of subscripts). The result will 
ie different tensors of the rank r — 2. 
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expression obtained. The result of these operations is known as the- 
scalar product of tensors 1 . A typical example is the scalar product 
of vectors (tensors of the first rank) which we have just mentioned: 
the tensor is contracted over the pair of subscripts i and k, 
and the result is the expression 2 

In accordance with the above, by a scalar product of the tensor 
T ih and the vector a h is meant the vector b t with the components- 

b t = ^T lh a h (X.22) 

h 

Let us convince ourselves that the set of quantities b t determined 
in this way does form a vector. For this end, let us find the law of 
transformation of the quantities b t . It is evident that 

b'i = ^T' ih a' k 

h 

Let us introduce into the last expression the values of T\h and a £ 
in terms of the unprimed components: 

— = 2 2 ^ll&hrnT Im 2 = 2 ® ll 2 2 ^hm^ka 

h k l, m s l m, s h 

According to the property (VI.40), 2 a km a hs = b ms . Consequently,. 

h 

b'i = 2 a ii 2 T lm a $ b mi = 2 a » 2 T lm a m = 2 a iih 

l m, s l m l 

The result we have obtained signifies that the quantities b t are- 
transformed according to the law of transformation of vector com- 
ponents. Hence, the quantities b t determined by expression (X.22) do- 
indeed form a vector. 

In a similar way, we can convince ourselves in the correctness of 
the following statement: if a set of nine quantities X ih taken with- 
the components of a vector a h in the combination 

2 x ikd k 
h 

gives the components of another vector & <f the quantities X ih are- 
the components of a tensor. We encountered such a situation at the- 
beginning of this Appendix. The set of nine quantities Xih taken 
with the components of the vector E h in the combination (X.5) gave- 
the components of the vector P t . On these premises, we showed that 
the quantities x tk are transformed according to the law of transfor- 
mation of tensor components, i.e. that the dielectric susceptibility- 
is a tensor. 


1 The latter product is sometimes called an inner one, whereas expressioni 
(X.20) is called an outer product of tensors. 
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As a result of the scalar multiplication of the vector a and the 
tensor T, we obtain the new vector b: 

b = Ta (X.23) 

The tensor T can therefore be considered as a linear operator trans- 
forming one vector into another one. 

Let us find the product of a unit tensor and the vector a,,. By for- 
mula (X.22), we have 

— 2 b ih a h = Oj 

h 

Hence, a vector does not change when multiplied by a unit tensor. 

3. Symmetric and Antisymmetric Tensors. The tensor S t k whose 
•components satisfy the condition 

= S hi (X.24) 

is said to be symmetric. We must note that the dielectric susceptibil- 
ity tensor treated above is symmetric. 

The tensor A ih whose components satisfy the condition 

A ih = — A hi (X.25) 

is said to be antisymmetric. 

The property of symmetry or antisymmetry belongs to the tensor 
itself. This follows from the fact that this property is retained in 
any transformations of the coordinates. We shall give the following 
calculations to prove this statement: 

Sift = 2 a H a hmSlm— 2 a im a hl^mi = 2 a hl a im^ lm — S'ki 
l, m m, l l, m 

•(we first interchanged the dummy indices l and m, and then replaced 
S mt with S lm equal to it). We have proved that a tensor symmetric 
in the system K will be symmetric in any other system K' . 
Similarly 

Aik — 2 ^H^hmAim — 2 ^im^hlAml — 2 O'hl&imAim ~ Aft j 

l, m m, l l, m 

(after interchanging the dummy indices, we replaced A m[ with 
— Ai m equal to it). We have proved that a tensor antisymmetric in 
K will be antisymmetric in K' too. 

Any tensor T ih can be written as the sum of a symmetric and an 
antisymmetric tensors. Indeed, let us write T ih as 

T — T ih -j- T hi . T jh Tfri 
1 ih 2 ' 2 

The lawfulness of such an expression is obvious. At the same time, 
the first term does not change when the subscripts i and k are inter- 
changed, i.e. it has the properties of the components of a symmetric 
tensor; the second term, on the other hand, changes its sign when 
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the subscripts are interchanged, i.e. has the properties of the com- 
ponents of an antisymmetric tensor. 

We can therefore always consider that 


where 


T ih — S ih + .4/;, 

(X.26) 

c T ik~\~ T hi . T ih T hf 

° i h ~ 2 ’ ^ih 2 

(X.27) 


The concepts of symmetry and antisymmetry can also be applied 
to tensors of a higher rank. For instance, T t ki is said to be symmetric 
{antisymmetric) relative to the subscripts i and k (or i and l, or 
k and Z) if upon transposition of these subscripts the components of 
the tensor do not change (reverse their sign). If the components of 
a tensor do not change (or reverse their sign) upon the transposition 
of any pair of subscripts, the tensor is said to be absolutely symmetric 
(absolutely antisymmetric). 

The set of 27 quantities e ife; introduced in Appendix VI forms 
an absolutely antisymmetric tensor of the third rank. Recall that 
the quantities B ihl (1) are zero if any two subscripts have identical 
values; (2) equal +1 if all the subscripts are different and form a 
cyclic transposition of the sequence 1, 2, 3; (3) equal — 1 if all the 
subscripts are different and form a cyclic transposition of the sequence 
3, 2, 1. Of the 27 components of this tensor, 21 are zero, three are +1, 
and three are —1. It can be shown by simple but cumbersome cal- 
culations that 

= 2 ^im^hp^ls^mps = ^ihl (X.28) 

m, p, s 

i.e. that the tensor e ih i is invariant. 

4. True Tensors and Pseudotensors. Let us see how the components 
of a tensor transform upon the inversion of the coordinate axes. The 
transformation coefficients in this case have the values 


°-lh S ih 

[see (VI. 42)]. Therefore, the formula for the transformation of the 
components of a tensor of rank r, T (altogether r subscripts), in 
inversion is 


T’ ik ...i= 2 (-6 <m )(-6 M ) ... (-6 u )T mp ..., 

m , p. . .s 

— { — l) r 2 $im&hp • ■ • SlsT-mp. . . a = ( 1 

m, p. . . 3 

lienee, the components of a tensor of rank r (we have in mind 
tensors in three-dimensional space) are transformed in inversion 
by the formula 

T'ik...l ~ ( — l) r T ih...l 

By (X.28), the quantities e t hi in any transformation of the coor- 
dinates (and, consequently, in inversion) remain invariant, i.e. 
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do not change their sign. The components of a true tensor of rank 
three, on the other hand, must reverse their sign in inversion. This 
is why the set of quantities e iM forms a pseudotensor instead of 
a true one. 

A pseudotensor of rank r is defined to be a set of 3 r components 
Pth,..i which upon rotations of the coordinate system behave like 
the components of an ordinary tensor, and upon inversion are trans- 
formed by the formula 

p\k...t = (-ir +l 

Let us consider examples of pseudotensors of different ranks. We 
acquainted ourselves with pseudoscalars and pseudovectors in 
Appendix VI. We treated a pseudotensor of the third rank t ih i 
above. The set of quantities P ik formed from the products of the 
components a t of the true vector and p k of the pseudovector, i.e. 

Plh =■ OiPk 

is a pseudovector of the second rank. Indeed, upon inversion, the 
quantities a t change their sign, while the quantities p k remain un- 
changed. Therefore, P ih in inversion will change its sign. At the 
same time, we know that the components of a true tensor of the 
second rank do not change their sign in inversion. Hence, P ih is 
a pseudotensor. 

It is a simple matter to see that the contraction of a pseudotensor 
also results in a pseudotensor [the evenness or oddness of a tensor’s 
rank in contraction does not change, while the formulas for the 
transformation of the components of the initial tensor and its contrac- 
tion differ by the factor (—1)®]. For instance, in contracting the 
pseudotensor P^, we obtain the scalar product of a true vector and 
a pseudovector, i.e. a pseudoscalar. 

The product of a pseudotensor Pi h ...i of any rank and a true tensor 
T mp...a of any rank is a pseudotensor: 

To verify this statement, let us compile the following table: 

Table X.l 


p ih... 1 

T mp ... a 

wmam 

rank 

sign upon 
Inversion 

rank 

sign upon 
Inversion 

rank 


even 

even 

odd 

odd 

changes 

changes 

does not change 
does not change 

even 

odd 

even 

odd 

does not change 
changes 

does not change 
changes 

even 

odd 

odd 

even 

changes 

does not change 
does not change 
changes 
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Similarly, we can see that the product of two pseudotensors of 
any ranks is a true tensor. 

5. Properties of a Symmetric Tensor of the Second Rank. Of the 
nine components of a symmetric tensor of the second rank, only 
six are independent (S 12 = S 21 , 

S13 = 5 31, S 23 — $32)- 

A symmetric tensor of the se- 
cond rank allows an important 
geometric interpretation. Before 
considering it, we shall note that 
the vector a can be represented 
not only by a directed segment 
of a straight line, but also by 
a plane whose equation is 

ar = 1 (X.29) 

where r is a position vector of a Fig- X.2. 

point on a plane (Fig. X.2). Since 

ar = ar a — 1, we have r a — 1/a. Consequently, Eq. (X.29) defines 
a plane perpendicular to the vector a and at a distance i/a from the 
origin of coordinates. Expression (X.29) can also be written as 
a T r — 1, whence r = 1 la T . Hence, on the straight line passing through 
the origin of coordinates and having the direction n (see Fig. X.2), 
the plane (X.29) intercepts a segment of the length 

p = J- (X.30) 

Now let us turn to the symmetric tensor S. We shall correlate 
with it a surface determined by the equation 

r (Sr) = 1 (X.31) 

It is obvious that this surface does not depend on the choice of the 
coordinate system in which the components of the tensor S and of 
the position vector r are determined. The nature of the surface 
depends only on the properties of the tensor S. 

Let us choose an arbitrary coordinate system z lt x 2 , x 3 and express 
the left-hand side of Eq. (X.31) in terms of the components r and S 
in this system. By (X.22), we have 

(Sr)j = 2 S lh x h 

ft 

Scalar multiplication of r and the vector Sr yields 
2 x, (Sr), = 2 x i 2 S lk x k = 2 S ik x t x h 

i i ft i.ft 
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Consequently, Eq. (X.31) has the form 

SVa = 1 (X.32> 

i, h 

In the expanded form, this equation appears as follows (recall that 

S ih = Shi)'- 

S n x l + <S 22 x*'+ S 33 xl + 2S li x 1 x 2 + 2 S 23 x 2 x 3 + 2S 3l x 3 x 1 = 1 

(X.33) 


Equation (X.33) determines a second-degree surface whose centre 
is at the origin of coordinates. In the applications of tensor calculus 
to physical problems, the diagonal components Sn may be greater 
than zero (this happens, for example, with the quantities %a)- In 

this case, the surface (X.33) is an ellip- 
soid. It is exactly this ellipsoid that is 
the geometrical image of a second- 
rank symmetric tensor (naturally, in 
three-dimensional space), like a 
directed line segment or the plane 
(X.29) is a geometrical image of a 
first-rank tensor, i.e. a vector. 

Let us find the distance from the 
centre of the ellipsoid to points on 
its surface. To do this, we draw an 
arbitrary straight line n (Fig. X.3) 
from the centre of the ellipsoid and assume it to be the Xj-axis. 
Now the distance p from the centre of the ellipsoid to the point 
P will equal the value of x t when x 2 = x 3 = 0. Assuming in 
Eq. (X.33) that x 2 — x 3 = 0, we find that S n xJ = 1, whence 



Fig. X.3. 


[compare with (X.30)l. Hence, the distance p is the reciprocal of the 
square -root of the tensor component S n evaluated for the condition 
that the direction along which p is measured has been taken as the 
Xj.-axis. 

In transformations of the coordinates, the coefficients at S ih in 
Eq. (X.33) change, but the ellipsoid itself does not depend on the 
choice of the coordinate system. If we direct the coordinate axes 
along the semiaxes of the ellipsoid, the equation of the latter, i.e. 
Eq. (X.33), as is known, becomes simpler and acquires the form 

+ *S 22 x* + S 33 xl = 1 

This signifies that with this choice of the coordinate axes, the non- 
diagonal tensor components vanish. Consequently, the principal axes 
of the tensor coincide with the semiaxes of the tensor ellipsoid. 
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If the coordinate axes are directed along the principal axes of the 
tensor (i.e. along the semiaxes of the tensor ellipsoid), the tensor 
acquires a diagonal form: 

A, 0 0 \ 

(S, k )= 0 X a 0 (X.34) 

\0 0 X 3 ) 

We have introduced the notation iS n = S 22 = X 2 , S 33 = X, 3 . 

The quantities A. lt X 2 , and X 3 are the principal values of the tensor. 

When X 1 = X 2 = X 3 — X, the tensor ellipsoid transforms into a 
sphere. Particularly, a sphere of unit radius corresponds to the unit 
tensor f) ih . 

Let us multiply a vector a by the tensor (X.34). The result is the 
vector b whose components are determined by formula (X.22). The 
components of the tensor (X.34) can be written as S ih — X t 
Consequently, 

bi — ’^jX i 8 ik a h = X i a i (X.35) 

h 

Let the vector a be directed along the first principal axis of the tensor. 
Hence, a x = a, a 2 = a 3 — 0 [having taken the tensor in the form 
of (X.34), we assumed that the coordinate axes are directed along the 
principal axes of the tensor]. By formula (X.35), the components of 
the vector b will be b x = b 2 = b 3 — 0. This signifies that the 
direction of the vector b coincides with that of the vector a, while 
the magnitude of the vector b is Xj times that of the vector a. The 
same also holds for the other two principal axes. Therefore, if the 
vector a is directed along one of the principal axes of the tensor, the 
following equation holds: 

Sa = X& (X.36) 

where X is the relevant principal value of the tensor. 

We have arrived at the following result: when a vector having the 
direction of one of the principal axes is multiplied by a tensor, the 
direction of the vector does not change, but its magnitude grows by 
a number of times equal to the relevant principal value of the tensor. 

Equation (X.36) holds for any coordinate system if only the vector 
a is directed along one of the principal axes of the tensor. This cir- 
cumstance can be used to find the principal values and the principal 
axes of a tensor. For this purpose, we must vary the direction of the 
vector a until the vector Sa coincides in direction with a. The direc- 
tion found gives a principal axis of the tensor, and the ratio of Sa 
to a — the corresponding principal value. Analytically, this can be 
expressed as follows. Let us write Eq. (X.36) in components (we have 
not yet reduced the tensor to its principal axes so that, generally 
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speaking, all the S lk ’s are non-zero): 


S n flj -f- S i2 a 2 -f- S 13 a 3 = A.a t 

^2l a i “I - S zz a 2 ~l~ ^23 a 3 ~ 
^31 a i S S2 a 2 "1“ S 33 a 3 — ^ a 3 


Combining like terms, we obtain 

(S u — X) a { -f- S i2 a 2 +iS'i 3 ffl 3 =0 | 

(<522 — ^>) a 2 + $23 a 3 =0 / 

^31 a l4* *^32 a 2 + (>^33 — A,)a 3 = 0 J 


(X.37) 


We have arrived at a homogeneous system of three equations with 
the unknowns a lf a 2) a s — the components of the required vector a. 
For this system to have a non-zero solution, it is necessary that its 
determinant be zero [see Appendix VIII, the text following formula 
{VIII. 26)]. Hence, the following condition must be satisfied: 


S n — X S n S i3 

S Z 1 S zz — X $23 

S 3i S 32 S 33 — X 


= 0 


(X.38) 


The roots of this cubic equation in X are the principal values of the 
tensor: A. lt X 2 , X 3 . Introducing one of these roots into the system 
(X.37), we can find the ratios a 2 la x and a 3 la x that will determine the 
direction of the vector a satisfying the equation (X.36), i.e. the rele- 
vant principal axis of the tensor. By performing this operation for all 
three X{s, we find the directions of all the principal axes. r 

We have assumed that all the roots A. x , X 2 , and X 3 are different. 
When Eq. (X.38) has multiple roots, clarification is needed. With 
two multiple roots (A, 2 — X 3 — X, X 3 ^= X), the tensor ellipsoid will 
be an ellipsoid of revolution. Only the principal axis of the tensor 
coinciding with the axis of symmetry of the ellipsoid will be deter- 
mined uniquely. Any two mutually perpendicular axes normal to 
the axis of symmetry of the ellipsoid may be taken as the other two 
principal axes. If all three roots of Eq. (X.38) are the same (A x = 
= X 2 — X 3 — X), the tensor ellipsoid degenerates into a sphere, and 
any three mutually perpendicular axes may be taken as the principal 
ones. 

Let us consider the properties of a symmetric tensor as an operator. 
By (X.36), the action of S on the vector a directed along one of the 
principal axes of the tensor causes only the length of the vector to 
change X times. The action of S on an arbitrarily oriented vector 
causes, generally speaking, a change in both the length and the 
direction of the vector (see, for instance, Fig. X.l). 
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6 . Properties of an Antisymmetric Tensor of the Second Rank. 
Let us now turn to an antisymmetric tensor of the second rank. 
Condition (X.25) for the diagonal components has the form A it = 
= — A a- This is possible only when A it =0. Consequently, the 
diagonal components of an antisymmetric tensor are zero. Of the 
remaining six components, only three are independent. Hence, an 
antisymmetric tensor of the second rank, like a vector, is determined 
by three quantities: 

A 12 = — A 2l = as'j 

A 2 3 = — A 32 = d 2 / (X.39) 

A31 — — Ai 3 = fliJ 

Using this notation, we can write an antisymmetric tensor as fol- 
lows: 

( 0 a 3 — a 2 \ 

— a 3 0 a, (X.40) 

a 2 —a t 0 J 

The components of the tensor (X.40) can be written as 

A ik — 2 £ihl a l = Eihl a l + e ih2 a 2 + e ift3 a 3 (X .41) 

( 

where is an absolutely antisymmetric pseudotensor [see the text 
preceding formula (X.28)]. Indeed, the components A ti determined 
by this formula are zero because Em at any l equals zero. Further, 

Ai 2 “ £121^1 ~f" ^122^2 “f* ^ 123^3 = 0 • Uj -}“ 0 • fl 2 “1“ 1 ’ &3 = ^3 
■ 4 i 3 = e i 3 i a i + 8132^2 + e 133 a3 = O-Oi + ( — 1 )-a 2 + 0 -a 3 = — a 2 
etc. 

It was shown above that the multiplication of a true tensor by 
a pseudotensor results in a pseudotensor. The product of two pseudo- 
tensors, on the other hand, is a true tensor. The fact that the quan- 
tities a f when multiplied by the pseudotensor e ih i [see formula (X.41)] 
yield the components of a true tensor of the second rank indicates that 
these quantities have the properties of a pseudotensor of the first 
rank, i.e. a pseudovector. 

Consequently, a pseudovector with the components a t determined 
by formulas (X.39) can be correlated with any antisymmetric tensor 
of the second rank. And, conversely, a true antisymmetric tensor 
of the second rank whose components are expressed in terms of the 
components of a pseudovector by formula (X.41) can be said to 
conform to any pseudovector a t . 

Let us apply the operator A (here A is an antisymmetric tensor) 
to a vector b. By (X.22) and (X.41), we obtain the vector c having 

24-018 
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the components 

c i = 2 A lh b k = 2 Zihibk a i 
h h, l 

Comparing the expression obtained with formula (VI. 33), we arrive 
at the conclusion that c can be written as the vector product 1 of the 
vectors b and a: 

c — Ab — [ba] 

The vector c is normal both to the initial vector b and to the pseudo- 
vector a corresponding to the tensor A. Hence, the action of the 
tensor A causes the vector to turn through the angle n/2. In addition, 
generally speaking, the length of the vector changes. 


XI. Basic Concepts of Vector Anatysls 

Gradient. Let us consider a scalar field, i.e. a region of space to 
each of whose points there corresponds a definite value of the scalar cp: 

cp = cp ( P ) = <p (r) — <p (x v * 2 , x s ) 

where r is a position vector, and x lt x 2 , x 3 are the Cartesian coordinates 
of the point P. 

An identical value of cp corresponds to all the points of the surface 
determined by the equation 

j {x x , x 2 , x 3 ) = const (XI. 1) 

A surface having the form of (XI. 1) is said to be a constant cp surface. 
Such a surface can be drawn through any point, of a field. 

When displaced from the point P over the distance dr, the function 
cp receives the increment 

d <p~2-§r dXt 

i 

The last expression does not depend on the choice of the coordinates 
X(, i.e. is-an invariant. The set of quantities dx t forms the vector dr. 
We can therefore state [see the text following formula (VI. 28)] 
that;the quantities d(pldx t are the projections of a vector onto the 
Xj-axis. This vector is called the gradient of the scalar cp and is 
designated by the symbol grad cp. Hence, 


grad cp = ^ 


_ ^ dtp 


(X1.2> 


or, in conventional notation, 


grad cp = e* + 


1 Recall that a vector product is a true vector only if one of the multipliers 
is a pseudovector. We could also conclude from this circumstance that a is not 
a true vector, but a pseudovector. 
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It is a simple matter to extend the definition (XI. 2) to an n-di- 
mensional space. In the latter case, the number of terms in formula 
(XI. 2) will be n instead of three. 

We shall prove that the components of a gradient are transformed 
by formulas (VI. 37). Having taken two coordinate systems, K and 
K' , we can write 

^ = (xi.3) 

t i 1 


Let us express dx t in terms of dx' h by formula (VI. 38) and introduce 
these expressions into (XI.3): 


X 1 g( P X' 
Zj dx t Zj 

i h 



We change the sequence of summation over the subscripts i and k 
on the left-hand side: 


h i 



The subscripts i and k are dummy ones. We have already noted 
that a dummy index can be designated by any letter. Therefore, the 
sum on the left will not change if we interchange the subscripts i 
and k. The result is 


2 d;r * 2^* 
i k 



It follows from the relation we have obtained that 



which coincides with (VI. 37). We have thus shown that the quantities 
<9q>/(tej upon transformations of the coordinates behave like the 
components of a vector. 

William Hamilton introduced the vector differential operator V; 
(the del operator or the Hamiltonian operator) that is a vector with 
the components d/dx, d/dy, and d/dz: 


V ~ ex ~dl + e ^ 


d , d 
+ e '5T = 



(XI. 4) 


The vector V has no meaning by itself. It acquires a meaning when 1 
applied to a scalar or vector function. For instance, upon the sym- 
bolic multiplication of V by <p, we obtain the gradient of (p: 






dxt 

■ < I 
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Consequently, V<P = grad cp. By (XI. 3), the increment of <p can be 
written as the scalar product of the vector grad cp and dr. 

dq> = grad cp- dr = (Vcp) dr (XI. 5) 


In movement over a constant cp surface, drp = 0. It thus follows in 
accordance with (XI. 5) that the vector Vcp at each point of the held 
is directed along a normal to the constant <p surface. Let us find 
the rate of change in cp along a certain direction l, i.e. dyldl. By 
(XT. 5), the increment of cp over the segment dl is (Vcp) dl = (Vcp)j dl, 
where (V<p)i is the projection of the gradient onto the direction l. 
Therefore, dcp/dl — l(V<p)( dl]/dl — (Vtp)j. Hence, the projection of 
the gradient onto a certain direction gives the rate of change in the 
function in the given direction. 

We must note that the vector yep exists at every point of the scalar 
field cp. Consequently, a gradient forms a vector field, i.e. a region 
of space to each point of which there corresponds a definite value 
of the vector Vcp. 

^Divergence. Assume that we are given the field of the vector a. 
The flux of the vector a through the surface / is defined to be 


O, 


a = j a-n df = j a df 


(XI. 6) 


where a n is the projection of the vector a onto a positive normal 
to the surface element df, and df is the vector of the element; its 
magnitude equals that of the surface element df, while its direction 
coincides with that of a positive normal to the element. The direccion 
of the positive normal is determined depending on the circumstantes. 
For example, the outward normal is considered to be positive in 
calculating the flux through a closed surface. 

The name “flux” is due to the fact that for the field of a liquid’s 
velocity vector, the integral (XI. 6) gives the flux of the liquid through 
the surface /, i.e. the volume of the liquid flowing through / in unt 
time. 

Let us surround the point P of a field with the closed surface / 
and evaluate the flux <I> a through this surface. The ratio of d> a to V 
will characterize the properties of the field in the vicinity of the 
point P averaged over the volume V confined within /. The smaller 
the linear dimensions of the volume, the closer will the average 
characteristic be to the true characteristic of the field at the point P. 
The scalar quantity 

diva = lim -% 5 - = lim 4jr (f> a n df (XI.7) 

v~p v v-p v 

i . 

is known as the divergence of the vector field at the point P. 
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The definition (XI. 7) is a most general one not depending on the 
choice of the coordinate system. Let us find an expression for the 
divergence in terms of the projections of a onto the axes of a Cartesian 


coordinate system. We take a 
volume in the form of a rectan- 
gular parallelepiped with faces 
perpendicular to the coordinate 
axes in the vicinity of the point 
P (Fig. XI.l). Let us find the 
flux of the vector a through the 
faces 1 and 2 perpendicular to 
the x-axis. An outward normal 
to the face 1 coincides in direc- 
tion with the x~axis. Hence, for 
this face, a n = a xl (the sub- 
script 1 indicates that the value 
of a x is taken at a point on the 
face 1). An outward normal to 
the face 2 is opposite in direction 
for it a n = — a xi (the subscript 
is taken at a point on the face 2) 
1 and 2 is 



to the x-axis. Therefore, we have 
2 indicates that the value of a x 
. The total flux through the faces 


Oj. — ^ a xi df l ^ a . X 2 dfz — ^ ( a xt a xz) df 

(i) (2) f 


(XI.8) 


where df — df x = df 2 (see Fig. XI.l), a xl and a x2 are taken for 
points on the faces 1 and 2 with identical y’s and z’s. The integral 
on the right is taken over the surface / of any of the faces 1 and 2. 
Let us expand a x into a series in the vicinity of the point P: 

a*=a*p+ (-%-) P ( x — x p) + (^§f-) p (y— U p) 


+ (^f-) p (z-z P ) + e x ( XI - 9 ) 

Here x P , y P , and z P are the coordinates of the point P, a xP is the 
value of a x at the point P, ( da x ldx) P , etc. are the values of the deriv- 
atives at the point P, and e x is a quantity of a higher order of small- 
ness than the differences (x — x P ), (y — y P ), and (z — z P ), i.e. 
a quantity diminishing more rapidly than the linear dimensions of 
the parallelepiped. 

Assuming in expression (XI. 9) that x — x lt let us find the values 
of a x at points on the face 1 , i.e. a xX \ assuming that x = x 2 , we 
obtain the values of a x2 . Subtracting these values from one another, 
we obtain the following expression for the opposite areas df x and df 2 
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(the values of y and z are the same for them): 

i a x { — dxz = ( ) p ( x i — x z) + ei 

where again e* is a quantity diminishing at a faster rate than the 
linear dimensions of the volume. 

Using the value we have found in formula (X1.8), we obtain 

= P ( x i — x z) j df+ j z' x df= (-^J-) p (*i — **)/ + [e*df 

i t i 

A glance at Fig. XI. 1 reveals that the product (i x — x 2 ) / is the 
volume F of the parallelepiped. Hence, 

where ei is a quantity of a higher order of smallness than V. Similar 
expressions are also obtained for the fluxes through a pair of faces 
perpendicular to the axes y and z: 

.. 

Summating d> x , and $> z , we obtain the total flux of the vector 
a through the surface of the parallelepiped. By dividing this flux 
'by V in accordance with the definition (XI. 7) and performing the 
limiting process 1 V 0, we arrive at the formula 

diva =T+^-+^- (xlio) 

(we have discarded the superfluous subscript P on the derivatives). 
Expression (XI. 10) we have found for the divergence can be 
written as 

< XU1 > 

In this form, the concept of the divergence can be extended to vector 
fields in an rc-dimensional space. The definition (XI. 7) can also be 
extended to an rc-dimensional space. In this case, by an element of 
volume we should understand dV* = dx x dx 2 dx 3 . . . dx n The 
integral must be evaluated over an ( n — l)-dimensional hypersurface. 
An element of the hypersurface perpendicular to the r ft -axis will 
equal d J* = dx y dx z . . . dx^.x dx ft + 1 . . . dx n . In a four-dimensional 
space, a conventional three-dimensional volume will be the hyper- 
surface. 


1 In the limiting process, the quantities a’ IV shrink to zero. 
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Expression (XI. 11) can be considered as the sum of products of the 
quantities Vi = dldx t and a ( , i.e. as the scalar product of the vectors 
V and a. The divergence can therefore be written as 

div a == Va (XI.12) 

The quantity Va exists at every point of the vector field of a. Con- 
sequently, the divergence forms a scalar field determined in the 
same part of space as the field of the vector a. 

Let us take the finite volume F confined 
within the surface / (Fig. XI. 2) in the field 
of the vector a. We divide this volume into 
, elementary volumes AT. By (XI. 7) for the 
flux A®„ through the surface of such a volume, 
we can write A® a « div a-AF = Va-AF. 

Let us summate these expressions for all the 
elementary volumes. In summating A®„, the 
fluxes through the faces separating two ad- 
jacent volumes mutually nullify each other 
(for adjacent volumes, the fluxes differ in their 
signs because the outward normals n and n' 
have opposite directions). Only the fluxes through the portions of the 
outer surface / remain uncompensated, so that the sum gives us the 
flux of the vector a through this surface. The sum on the right in 
the limit (when AF 0) transforms into an integral over the entire 
volume. The approximate equality in the iimit will transform into 
a strict equality. As a result, we obtain 

adf=JvadF (XI. 13) 

v 

The relation we have obtained is called the Ostrogradsky-Gauss theo- 
rem. It states that the flux of a vector through a closed surface equals 
the integral of the divergence over the volume confined by this surface . 

Curl. The circulation of the vector a over the contour Y is defined 
to be the expression 

C a = § a dl = § a, dl (XI. 14) 

rj ir 

For example, in a potential field of forces, the circulation of the 
vector F equals the work of the forces on the closed path T. 

Let us take in the vicinity of the point P the contour T in a plane 
passing through P. We shall find the circulation C a around this 
contour. The ratio of C a to the surface / enclosed by the contour will 
characterize the properties of the field in the vicinity of the point P, 
averaged over the surface /. The smaller the linear dimensions of the 
surface, the closer will the average characteristic be to the true 


§ 
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characteristic of the field at the point P. At the limit, when the 
contour shrinks to the point P, the average characteristic will trans- 
form into the true one. Therefore, the properties of the vector field 
at the point P can be characterized by the quantity 

lim-^- = lim4-<$adl (XI. 15) 

/-» o i ' *• 

The quantity (XI. 15) depends not only on the properties of the 
field at the point P, but also on the orientation of the plane in 
which the contour is. The orientation of this 
plane in space can be set by a normal to the 
plane associated with the direction of circum- 
vention of the contour F in integration by the 
right-hand screw rule (Fig. XI. 3). For different 
directions of n, the quantity (XI. 15) will have 
different values at the same point P, and it is not 
i difficult to see that values of the quantity 
(XI. 15) differing only in their sign will corres- 
pond to opposite directions of n. Consequently, 
Fig. XI. 3. the quantity defined by formula (XI. 15) behaves 

like the projection of a vector onto the direction 
of a normal to the contour F. This vector is known as the curl of the 
vector field at the point P and is designated by the symbol curl a. 
Hence, 

(curla) n — Jim -^ 2 -=lim a <fl (XI. 16) 

i-o > /-o ' •> 

Formula (XI. 16) gives a most general definition of the curl that 
does not depend on the choice of the coordinate system. Let us find 
an expression for the curl in terms of 
the projections of the vector a onto the 
axes of a Cartesian coordinate system. 

We shall begin with determination of 
the projection of the vector curl a onto 
the x-axis.. For this purpose, we take in 
the vicinity of the point P the contour 
T in a plane perpendicular to the x-axis 
(Fig. 'XI. 4). We choose the direction of 
circumvention of the contour so that 
it forms a right-handed system with 
the direction of the x-axis. Hence, the 
directions of n and the z-axis will coincide, and expression 
(XI. 16) will give (curl a) x . For the contour we have chosen, we have 

^ aidl = a dl — ^ (a y dy-\~ a z dz) 
r r r 

(for all the elements of the contour dx = 0). 



Fig. XI. 4. 




APPENDICES 


377 


The values of a v for points of the contour can be written as 1 
a v = a yP + ) p (y — y P )+[~-) p (,z — z P ) + e y 

= const + ) p ^ + {~W~ )p z + e w 

where the constant term includes three addends not depending on 
y and z, and e t/ is a quantity of a higher order of smallness than the 
linear dimensions of the contour. Consequently, 

§ a v d y = const dy + dy + [~) p § z dy + § e y dy 


It is not difficult to see that ^ dy = 0. In exactly the same way, 
the integral £ ydy~^§>d(y 2 ) is zero. It is simple to see from 

Fig. XI. 4 that ^ zdy — — where / is the area of the contour. 
Hence, 

§a y dy=-(^) p / + e' (XI. 17) 


where t' y is a quantity of a higher order of smallness than the area / 
of the contour. 

Similar calculations for a z yield the expression 

<§a z dz — const <£ dz + ) d y dz -f- ^ p § 2 dz + ^ b z dz 

The integrals ^ dz and <|> z dz — y ^ d (z 2 ) are zero, and <$ y dz = /. 
Therefore, 

§a z cfz = (-^) p / + eJ (XI. 18) 

The sum of expressions (XI. 17) and (XI. 18) yields ^ a d\. Dividing 

this sum by / in accordance with the definition (Xl.lG) aud per- 
forming a limiting process 2 , we obtain 


(curl a) x = -^ 


3 a y 

dz 


Having considered the circulation for contours oriented with the 
normal n along the y- and z-axes, we can obtain expressions for the 


1 The meaning of the quantities in this expression is similar to the meaning 
of the ones in formula (XI. 9): the term containing (x — x P ) is absent because 
x = x P for all the points of the contour. 

* The quantities e' If vanish in a limiting process. 
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projections of the curl onto these axes: 
da . r da. 


(curl a)„ 


dz 


dx 


(curl a)* = 


day 

dx 


da x 

dy 


It is easy to remember the formulas for the projections of a curl 
•onto the coordinate axes by giving attention to the fact that in 
each of them the subscript on curl a and the letters on the right in 
the denominators form a cyclic transposition following the scheme 


x-+y 

\S 


z 


Knowing the projections, it is simple to find the vector itself: 


curl a = e x 


I da z da v - 

V dy dz 



da x 

dz 



da u 

dx 


da x ) 
dy ) 
(XI. 19) 


"With a view to the fact that, for example, dajdy can be written as 
V y dz, etc. (V B = dldy is the projection of the vector V onto the 
y- axis), formula (XI. 19) becomes 


curl a = 


e* e y e z 

d_ _d d_ 

dx dy dz 


d X dy d z 


(XI. 20) 


Finally, a comparison with formula (VI. 31) allows us to write that 


curl a = [Va] 


(XJ.21) 


Taking advantage of formulas (VI. 29) and (VI. 33) for a vector 
product, we can write the following expressions for a curl and its 
&-th component: 

- [Va] = 2 [Va]„= £ e ftmn -g- (XI.22) 

t, h, l m. n 

The quantity (Va] exists at every point of the vector field of a. 
Consequently, the curl forms a vector field determined in the same 
part of space in which the field of the vector a is set. 

Let us take an arbitrary surface / enclosed by the contour T (the 
latter does not necessarily have to be plane and can have any shape) 
in the field of the vector a. We divide this surface into small elements 
A/ (Fig. XI. 5). By (XI. 16) for the circulation A C a around the bound- 
ary of an element A/, we can write the expression A C a « IVa]„ A/, 
where tVal n is the projection of [Va] onto a normal to the given 
element A / related to the direction of circumvention by the right- 
hand screw rule. 
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Let us surnmate these expressions for all the A/’s. In the summation 
of A C a , the integrals jadl taken along the boundaries between 
adjacent areas cancel each other (for adjacent areas these integrals 
differ in their sign because they are evaluated in opposite directions) 
Only the integrals j a dl for the sections coinciding with the contour T 

enclosing / remain uncompensated. The sum of these integrals gives 
the circulation of a around the contour T. 

The sum on the right in the limit (when 
A / 0) transforms into an integral over the sur- 

face. The approximate equality in the limit trans- 
forms into a strict equality. The result is 

[Va] dl (XI. 23) 

p f 

The relation we have found is called Stokes’s Fig. XI. 5. 
theorem. It states that the circulation of the vec- 
tor a around the closed contour F equals the flux of the vector [Va] 
through the surface enclosed by the contour T. 

The surface over which the integral on the right-hand side of 
formula (XI. 23) is evaluated may be any one, it is only important 
that it bound on the contour F. The direction of the normal n must 
be made to agree with the direction of circumventing the contour 
T in integration. 

Application of the Operator V to the Product of Functions. When 
formulas including V are being compiled, it is necessary to adhere 
to the rules of both vector algebra and differential calculus. Assume, 
for instance, that <p and ip are scalar position functions. Therefore, 

V (qn|>) = V* (q*|>) + V„ (qn|>) (XI.24) 

(the subscript on V indicates which of the functions it is acting on). 
The factor on which V does not act in the given term can be put 
outside the symbol V (the operator V acts only on quantities follow- 
ing it). Formula (XI.24) now becomes V ((pip) = cpV^ + TV<p9- 
In this expression, the subscripts on V are superfluous, so that we 
finally have 

V (cpip) = cpVT + *V<p (XI. 25) 

(the right-hand side is read “phi gradient of psi plus psi gradient 
of phi”). 

Let us apply V to the product (pa. Here there are two possibilities— 
a scalar or a vector product of the vectors V and cpa. The corre- 
sponding results are: 

V (<pa) = (<pa) + Vo (<pa) = aVq> + <PVa (XI. 26) 
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(“a gradient of phi plus phi divergence of a”) and 

IV, (<pa)] = [V„ (<pa)l + lV 0 i (<P a )l = I(Vq>), a] + 9 (Va) (XI. 27) 

Now let us apply V to the product [ab], first obtaining a scalar 
product of the vectors: V [ab] = V a lab] + V& [ab]. We perform the 
cyclic transposition (VI. 3) in each of the terms: 

V [ab] = b [V«a] + a [bV 6 ] = b [V«al — a [V 6 b] 

(we have exchanged the places of b and V& in the second term to have 
the vector b following the operator Vb that acts on it; the sign in 
the vector product has changed accordingly). Omitting the sub- 
scripts needed no longer, we arrive at the formula 

V [ab] = b [Va] — a [Vb] (X1.28) 

(“b curl of a minus a curl of b”). 

The vector product of [ab] by V yields [v, [ab]] = [V a , lab]] + 
+ [Vb, [ab]]. Let us expand each of the terms using formula (VI. 5): 
[V, tab]] = a (V a b) — b (V a a) + a (V 6 b) — b (Vba). Arranging the 
factors so that we may suppress the subscripts on V, we obtain 

[V, [ab]] = (bV) a - (av) b + a (Vb) - b (Va) (XI. 29) 

The expressions (aV) and (bV) are scalar differential operators. For 
example, 

<*V> 2 «,-£ (XI.30) 

i 

These operators can be applied to both scalar and vector functions. 
When applied to the scalar 9 , the operator (XI.30) gives 

(aV)9=S fl i^ = a (V9) (XI. 31) 

i 

The actioh of the operator (aV) on the vector b yields the expression 

(«v)b = 2 «,,» r= 2“.4r(2 «A ) = 2 «» 2 «. -gf- <x i.32) 

i i k hi 

Let us apply the operator (XI.30) to the product of the scalar 
function 9 and the vector function b: 

(aV) ( 9 b) = (aV,) (<pb) + (aV 6 ) ( 9 b ) = b (aV) 9 + 9 («V) b 

= b (a-V 9 ) + 9 (aV) b (XI. 33) 

It is useful to know the value of the expression (aV) r, where r 
is a position vector, and a is an arbitrary vector. Substituting r 
for b in (XI, 32) and taking into account that dx h /dx t = 6 ih , we 
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obtain 

(a V )r=2e ft SaA ft = 2e h a ft = a (XI.34) 

hi h 

We have obtained formulas (XI.25)-(XI.29) quite easily. The 
finding of the gradient of the scalar product of the two vectors 
V (ab) is more involved because it is not clear what should be under- 
stood, for example, by the expression V a (ab). It cannot be inter- 
preted as (V 0 a) b because the operations of multiplying a and b 
and of applying V Q cannot be interchanged. This difficulty can be 
surmounted by employing auxiliary relations following from formula 
<VI. 5): [a, [Vbll = V& (ab) — b (V&a) = V 6 (ab) — (aV) b, whence 

V 6 (ab) = [a, [Vbll + (aV) b (X1.35) 

Having written [b, [Vail in the same way, we arrive at the relation 

V Q (ab) = [b, [Vail + (bV) a (XI. 36) 

Substitution of relations (XI. 35) and (XI. 36) into the formula 

V (ab) = Va (ab) + V 6 (ab) 

yields the following expression for the gradient of the scalar product 
of the vectors a and b: 

V (ab) = [a, [Vbll + [b, [Va]] + (aV) b + (bV) a (XI.37) 

Repeated Application of the Operator V« The action of the del 
operator on scalar or vector functions results in new vector or scalar 
functions which the operator V can be applied to, in turn. 

The gradient of the function cp is a vector. Consequently, both the 
divergence and the curl operations can be applied to it. Let us 
calculate the divergence of a gradient. By formulas (XI.2) and 
(XI. 11), we have 


V(Vcp) = V 2 <p-2 ? Xi (Vcp ) t -2 L t 

i i 

Hence 

3cp 

Xp 3 2 (p 

— A ( n 

dxi 

dxj 

i 


V (V<p) = Acp 



(XI. 38) 

where A is the Laplacian operator: 




c> 2 d 2 d* 

A dx 2 1 dy 2 ‘ dz* 



(XI. 39) 

Our calculations show that 




V 2 = A 



(XIAO) 


It must be borne in mind, however, that such a relation between the 
operators V and A occurs only in Cartesian coordinates. In other 
coordinate systems, for instance in a cylindrical or a spherical one. 
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relation (XI. 40) is not observed. The most general definition of the 
operator A that holds in any coordinate system is that following 
Irom relation (XI. 38), which can be written as 

Acp — div grad <p (XI. 41) 

Let us find curl grad tp. By (XI. 22), we have 

[V, V«- 2 e,„ -£(•£)■«, (XI.42) 

i. h, l 

Since 3 2 q> idx, dx h — d 2 q>/dx h dx t , expression (XI.42) is zero, so that 

curl grad qp = 0 (XI. 43)' 

This is what should be expected because [y, Vcp] = [yy] <p, and the 
vector product of a vector by itself is zero. 

Let us calculate the divergence of a curl. By formulas (Xl.ll) and 
(XI. 22), we obtain 

V [ya] = 2 ( 2 e hmn ) = 2 B kmn g^ a ” m 

k in, n ft, m, n 

Seeing that d 2 a n /d:Ck dx m — d 2 ajdx m dx h , the last expression equals- 
zero. Consequently, 

div curl a = 0 (XI. 44) 

We could have arrived at this result directly by taking into account 
that the scalar triple product of vectors (which is what y [ya] is) 
equals the volume of a parallelepiped constructed on the vectors 
being multiplied. Therefore, when two of the three factors coincide, 
such a product is zero. 

To calculate curl curl a, we shall proceed from formulas (XI. 22): 
[V, [ya]] = 2 e lhl~g^ri 2 J e< 

• j. It, I m, n 

Let us perform a cyclic transposition of the subscripts on e so that 
the subscript k is the last on both e’s, and let us use relation (VI. 16). 
The result is 

[V, [V.]]- 2 «I 2 

i, l,m, n h 

~ 2 e ‘ dxidx m — 

l, l, m, n 

= 2 e ‘ dx,dl m 6 lm 8 in— 2 e * 'dx t d2 m 6in6<m; 
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We summate over the subscripts m and n. This causes the expression 
we have obtained to become 


2 e * 

t, i 


d* a i „ d 2 ai 


dxtdxj 


— 2 e * 


i.l 


dx\ 


which can be written as follows: 


2<.TSr(2£)-2xr(2-*)-v(v»)-A. 

I i is/ 

We have thus arrived at the formula 


I.V, (Va)] = V (Va) — Aa (XI. 45) 

or 

curl curl a = grad div a — Aa (XI. 46) 

It is a simple matter to see that relation (XI. 45) can be obtained 
if we expand [V, tV»ll by formula (VI. 5), treating V as a conventional 
vector. 

Examination of (XI. 46) shows that 

grad div a — curl curl a -f Aa (XI.47) 

The divergence is a scalar. Therefore, no operation except that of 
finding the gradient can be applied to it. 

Some Formulas of Vector Analysis. Let us find the divergence and 
curl of the position vector r, and also the gradient of the magnitude 
of r Taking into account that r — (T) r|)V 2 . we obtain by (XI.2) 

^=E e i^-=S e iT' = T ==er ( XL48 > 

' i i 

where e r is the unit vector of the position vector r. 
by (XI. 11), we have 

W-=2-g-=2-g- = 3 <X1.49> 

) f 

(we have taken advantage of the fact that r, = x t ). In an «-dimen- 
sional space, Vr — n. 

It is not difficult to see that the curl of the position vector is zero: 

[Vr] = 0 (XI. 50) 

For this end, we can use formula (XI. 22), substituting x h for a h 
in it We obtain 

[Vr] = ^ E ikl 2 E thl&ik£l 

i, h. I i. k, l 

This expression is zero because at identical values of the subscripts. 
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i and k , the factor e ik i vanishes, and at different values of them, 
the factor 8 ih vanishes. 

Relation (XI. 50) also follows from the fact that the circulation 
of the vector r around any contour is zero. Indeed, according to 
(XI. 14) 

C r = <|> r dl = i ^(xdx + ydy + zdz) — 2- d (x 2 + y 2 + z 2 ) 


There is a total differential in the integrand, therefore the latter 
is zero. 

Now let us find the gradient of a function of r, i.e. Vcp (r). The 
partial derivative of cp (r) with respect to x h has the form 1 dcp ldx h = 
— dtp ldr-drldx k — d(f/dr-x h lr. Consequently, 


Vcp (r) = 


Vo 5< P _ 5f P 1 V o r 

dr r dr r h h dr r 
h h 


Taking formula (XI. 48) into consideration, we can write 


Vcp (r) 


dcp 

dr 


Vr = 


dtp r dtp 


dr 


dr 


(XI. 51) 


Assume that we have a function of the distance between two points: 
<p( I r-r' | ) = cp(]/2 (aq-xO 2 ) = <p(i?), where if = ]/ S (x t -x'i) 2 


The operation of finding the gradient can be applied to this function 
in two ways— we can perform differentiation either with respect to 
the coordinates x t or to the coordinates x[. To distinguish between 
these two gradients, in the first case we shall use the symbol V. 
and in the second, V'. The components of the gradient in both cases 
have the form 




5<p dR 
dR dx\ 


It is evident that the derivatives dR/dx | and dR/dx\ differ only in 
their sign. We thus conclude that 

Vcp = -V'cp (XI. 52) 

Let us calculate the curl of the unit vector e r = r/r. Representing 
r/r as the product of <p == 1/r and a — r, we shall apply formula 
(XI. 27). As a result, we find that [V, r/r] = [V (1/r) , r] + (1/r) [Vr]. 
By (XI. 51), we have V (1/r) = — (1/f 2 ) r/r. Therefore, the first term 
is zero. The second term is zero in accordance with (XI. 50). Hence, 

IV, r/r] = [V, e r ] - 0 (XI.53) 


1 We have Written dep/dr instead of dep/dr because it may happen that <p, 

in addition to r, also depends, say, on t. 
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Now let us find the curl of the function cp (r) e r , where cp (r) is an 
arbitrary function of r. We again apply formulas (XI. 27) and (XI. 51): 

[V, 9 (r) e r ] = [Vcp (r), e r ] + cp (r) [V, e r ] 

= ((dcp /dr) e r , e P l -f- 9 (r) [V, e r ] = 0 

[see (XI. 53)]. Hence 

[V, cp (r) e r ] = 0 (X1.54) 

Assume that we are given the vector function a (|) = 2 0 . k (5) e* 

k 

of the scalar quantity £ that, in turn, is a function of the coordinates 
Xj, that is, 5 = £ ( x t ). Let us find the curl and divergence of this 
function. According to the rules for the differentiation of a composite 
function, 

da h _ da h di (XI. 55) 


By (XI. 22) 


dii dxi 


[Vai- 2 2 


da j, 


i. ft, Z i.A.I 

The quantity dajd | is the /c-th component of the vector da/'d|, and 
dtldxi is the z-th component of the gradient of the function £. There- 
fore, 

[ V a]= 2 = -fr] 


t, a. 1 


[we have used formula (VI. 29)]. Hence, if a = a (£), where £ — 
— | (x lt x 2 , 2:3), we have 

(XI. 56) 


(Va|-[vS. Jf] 


To find the divergence of the function a (£), we proceed from 
formula (XI. 11). With a view to (XI. 55), we obtain 


v«- 2 lt= 2 -t^r= 2 (-S-).(vi).- 

ft ft ft 

-f--V£ (XI. 57) 

Assuming in formulas (XI. 56) and (XI. 57) that £ = r (where r 
is the magnitude of the position vector), we find that 

[V, a(r)I~[vr. -£■]_[.„ -f] 

(XI. 58) 

V.(r)-£vr-£f 

(XI. 59) 

[see (XI. 48)]. 

We shall prove the formula 


| [Va] dV — [na] d/ = [df, a] 

(XI. 60) 


25-018 
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where a is a vector function, V is an arbitrary volume, / is the surface 
confining it, and n is an outward normal to the surface element df. 
For this purpose, we find the scalar product of the vector [Va] and 
an arbitrary constant vector b. Inspection of the formula V tab] = 
= b [Va] — a [Vbl [see (XI. 28)1 reveals that b [Va] = V tab] + 
+ a [Vb] = V [abj ([Vbl = 0 because b = const). Now let us in- 
tegrate the relation obtained over a certain volume V and apply the 
Ostrogradsky-Gauss theorem to the right-hand side: 



v v f 


Performing a cyclic transposition of the vectors being multiplied 


in the last integral [see (VI.3)], we obtain opb [Va] dV 

v 

We put the constant vector b outside the integral: 


b [na] df. 


b j [Va] = 
v 



This relation must be observed with an arbitrary choice of the vec- 
tor b. We can therefore cancel b. As a result, we arrive at formula 
(XI. 60). 

Integral Determination of the Operator V. Let us consider the 
integral 

|) cp di (XI. 61) 


over the closed surface /, where <p is an arbitrary scalar function, 
di — n df is an elementary vector of the area df. Let us take the 
surface of an infinitely small rectangular parallelepiped with the 
sides dx, dy, dz as the surface over which the integral is being eval- 
uated. The integral (XI. 61) can therefore be written as the sum of 
six integrals each of which is evaluated over one of the faces of the 
parallelepiped. Owing to the smallness of the faces, the value of cp 
within the limits of each face may be considered as constant. Con- 
sequently, 


whence 


^ q> di = e x [cp (z-f- dx, y, z) — q>(x, y, z)]dydz 

+ e„[<p (x, y + dy, z)~ cp(x, y , z)]dxdz 
-f-e z [cp(x, y, z + dz) — cp (z, y, z)]dxdy 

= ( e *"fr + e *''f7 + ez '?r) dx dy dz ~ V q> dV 
Vtp = lim -pr<^) (p di 


(XI. 62) 
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In accordance with formula (XI. 62), the operator V can be deter- 
mined as follows: 


V = lim ~ (£ di 
v -o v ** 


(XI. 63) 


is 


Let us take a volume V and divide it into elementary volumes AK 
By formula (XI. 62), the approximate equality yep- AT « <^9 di i 
observed for each volume. 

Now let us summate this equation over all the elementary volumes. 
In summation, the right-hand sides corresponding to the boundaries 
between adjacent volumes mutually eliminate one another. Only 
the terms corresponding to the outer surface remain uncompensated. 
Therefore in passing to the limit at which all the AT’s shrink to 
zero, we arrive at the relation 

Jv9dF=j9rff (XI. 64) 

v f 

According to (XI. 64), the integral of the scalar function 9 over the 
closed surface / can be transformed into an integral over the volume 
confined in it by replacing the surface element di with the operator 
V dV. Here the components di experience the transformation 

d 


dft dV 


dxi 


(XI. 65) 


Naturally, the reverse transformation is also possible. 

Matters are similar with the integral of a vector function. Indeed, 
according to the Ostrogradsky-Gauss theorem (see (XI.13)I» we 

have ^ Va dV = <^> a di, which also agrees with (XI. 65). 
v t 

In general, the transformation (XI. 65) is always allowable regard- 
less of the specific kind of expression in the integrands. An example 
of such a transformation is relation (XI. 60). 

Curvilinear Coordinates. It is sometimes convenient to determine 
the position of a point not by means of the Cartesian coordinates 
x t , x 2 , x 3 , but by using three other numbers <fr, q 2 , q 3 corresponding 
better to the nature of the problem being considered. These numbers 
are known as the curvilinear coordinates of a point. 

Imposing (if the need appears) restrictions on the region of chang- 
ing of curvilinear coordinates, we can achieve a one-to-one corre- 
spondence of the variables x t and q t . Hence, q t = q t (x lt x 2 , x 3 ), and 
Xi = xi (ft, q 2 , q 3 ) (i = 1, 2, 3). 

Surfaces described by the equation q t (x u x 2 , x 3 ) = const are 
said to be coordinate surfaces. The lines of intersection of two coor- 
dinate surfaces are called coordinate lines. Only one coordinate varies 
along a coordinate line; the other two remain unchanged. 


25 * 
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An example of curvilinear coordinates is the spherical system 
r, d, <p. In this case, (1) spheres, r = const, (2) half-angle cones, 
{f = const, and (3) half-planes, <p — const, are the coordinate sur- 
faces. The coordinate lines are (1) radii— the lines r, (2) meridians — 
the lines d, and (3) parallels— the lines cp. 

If the coordinate lines at every point are mutually perpendicular, 
the curvilinear coordinates are said to be orthogonal. We shall limit 
ourselves only to a treatment of orthogonal coordinates. They in- 
clude spherical and cylindrical coordinates. 

Let us introduce for every point P the unit vectors e*, c*, e* 
directed along tangents to the coordinate lines at the given point 
towards an increase in the relevant variables q t . Owing to orthogo- 
nality, the following relations hold for the unit vectors e*: 

ef et = 6 ik (XI. 66) 

We determine the derivative of the position vector r = r (g x , q 2 , q 3 ) 
with respect to the coordinate q t . The remaining two coordinates do 
not change in differentiation. Consequently, when the coordinate q t 
is given the increment 6q t , the tip of the vector r travels along the 
coordinate line q t . Therefore, the vector dr/dq t is directed along 
a tangent to the coordinate line qi, i.e. is collinear with e*. Denoting 
the magnitude of the vector dr/dqi by the symbol Hi, we can write 
that 

-|- = H t efj (i = l, 2, 3) (XI. 67) 

Representing r in the form of 2ar ft e fe (here e h are the unit vectors 
of the Cartesian coordinate axes), we can write that 

-55- =2 -ilr < i=1 ' 2 ’ 3 > 

k 

Squaring of^this relation yields 

H\— 2 (-^7 ) 2 0 = 1, 2, 3) (XI. 68) 

h 


The quantities H ( are known as the Lame coefficients. Their values 
can be found by formula (XI. 68) if we know the form of the functions 
x h — x k (?i)- For example, for a spherical coordinate system, = 
= r sin -d cos <p, x 2 — r sin sin <p, x 3 — r cos ft. Hence 


dx , 

~Sq[ 


dx-, 

dr 


=sin'dcos 


<P> 


dx 2 dx 2 

dq r dr 


sin ■d sin 9, 


dx 

~dql 



0 


so that by (XI. 68) 

H\ — (sin d cos <p) 2 -f- (sin -0 sin cp) 2 + (cos ft) 2 — 1 
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Similar calculations for H 2 and II 3 yield 

H l = 1, H 2 — r, H 3 = r sin (XI.69) 

Let us find the square of the distance ds between two points. By 
(XI.67), we have 

dr = 2 = 2 d <h ( XI - 7 °) 

i 1 i 

Consequently, 

ds 2 = | dr \ 2 ='Zm dq\ (XI.71) 

i 

For a spherical coordinate system, this formula yields 

ds 2 = dr 2 + r 2 d-d 2 + r 2 sin 2 d d(p 2 (XI. 72) 


Let us draw coordinate surfaces through the tail and the tip of the 
vector dr. The result is an infinitely small parallelepiped whose edges, 
by (XI. 70), are 

dl i = H i dq i (i = 1,2,3) (XI. 73) 


The faces of this parallelepiped have the areas 

dfi — H 2 H 3 dq 2 dq 3 , df 2 — H 3 H x dq 3 dq x , df 3 = H x H 2 dq x dq 2 

(XI. 74) 

The volume of the parallelepiped is 

dV = H X H 2 H 3 dq x dq 2 dq 3 (XI. 75) 

Instead of calculation by formulas (XI. 68), the Lame coefficients 
can be found with the aid of expressions (XI. 73). For instance, for 
spherical coordinates (Fig. XI. 6), an elementary parallelepiped has 
the edges dl x = dr, dl 2 — r dft, and dl 3 = r sin 0 dtp, whence we 
directly obtain the values (XI.69) for the Lame coefficients. 

For a cylindrical coordinate system (Fig. XI. 7), the edges of an 
elementary parallelepiped are dl x — dp, dl 2 — p d<p, and dl 3 = dz, 
whence 

H x = 1, H a = p, H 3 = \ (XI. 76) 

Let us find the expression for the gradient in curvilinear coordi- 
nates. By formula (XI. 5), the projection of the gradient of the function 
ij) onto the direction of the unit vector e* is dtyldli. With a view to 
(XI. 73), we obtain 

d±_ = ±_ ftp 

dli Hi d<li 


The gradient itself is determined by the formula 


v*-S-f«f-S4rlr ef 


(XI. 77) 
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Consequently, in a spherical coordinate system 

1 5t) j 


_ . 3vt> . t <5-U> 

= e r -f T -^ c<> . 


and in a cylindrical system 

JhJ> 
dp 


r sin 0 d<f '' <p 


V^#e p + l-|ie, + -e z 


(XI. 78) 
(XI. 79) 


To determine the divergence in curvilinear coordinates, we shall 
proceed from the definition (XI. 7) according to which 

Va = lim (?) a di = lim ~ (XI. 80) 

v -* o K ^ v-o K 

Let us take as V the volume of an infinitely small parallelepiped 
including the point P for which the divergence is being calculated. 



Fig. XI. 6. 



Reasoning in the same way as when obtaining formula (XI. 11), let 
us write an expression for the flux through the faces perpendicular 
to the unit vector c*: 

0)! = a[ Af[ — a’&fl = ( ai H 2 H 3 Y Ag 2 Ag 3 — Ag 2 Ag 3 

[see (XI. 74)]. Here is the projection of the vector a onto the direc- 
tion of e*. One prime indicates the values of quantities relating to 
one of two opposite faces, and two primes indicate the values relating 
to the other face. 

Expanding into a series in the vicinity of the point P, 

we obtain 




d (a^H^H 3 ) 


d<i i 


F + e, 


h,h 2 h 3 
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where V is the volume of the parallelepiped Isee (XI. 75)], and e x 
is a quantity of a higher order of smallness than V. The fluxes through 
two other pairs of faces are evaluated in a similar way: 




1 


HiH 3 h 3 


d(a 2 H a H 1 ) y . ^ 

J51 v+ 2 ’ 3 


1 


d (a.H^) 


H\H 3 H 3 dq 3 


V + z 3 


Substituting d) = + d> 2 + ^3 into (XI. 80) and performing 

a limiting process, we arrive at the formula 


Va 


h 1 h 3 h 3 { 


a (ci-Z/ 3 H 3 ) 

dq 1 


4- 


d(a 3 H 3 Hi) 


dq 2 

h dq 3 J 


(XI. 81) 


Taking (XI. 69) into account, we obtain an expression for the 
divergence in spherical coordinates: 


Va = 


1 d (r 2 o r ) 


d (sin d<z$) 


da , , 


dr 


r sin i 


ad 


r sin d dcp 


In cylindrical coordinates 


Va: 


_1_ d (pa p ) . 1 day 
p dp ' p dtp 


da T 


dz 


(XI. 82) 


(XI. 83) 


By (XI. 16), the projection of a curl onto the direction of the unit 
vector e* is determined by the expres- 
sion 

[Va]j= lim (E a <21 = lim-^- (XI. 84) 
f t -+ o it •> / £ — o it 

where /,• is a surface element perpen- 
dicular to e*, and.C £ is the circulation 
of the vector a around the contour confin- 
ing this element. 

To evaluate the projection of [Va] 
onto the unit vector e*, let us take f 3 
as an infinitely small curvilinear 
rectangle with the sides AZ 2 = H 3 Aq 3 
and AZ 3 = H 3 A q 3 . The circulation 

of a around the contour of this rectangle can be represented in the 
form of four integrals like f a <21. Two of them are evaluated over 

the opposite sides AZ' and AZ", and two over the opposite sides A 1' 3 
and AZ" (Fig. XI. 8). As a result, with a view to (XI. 73), we obtain 

Ci = a 2 AZ 2 4* a 3 A Z 3 fl 2 AZ 2 fl s AZ s 

= (a"AZ; - a' 3 Al 3 ) - {a 3 Al\ - a' t Al' t ) 

= ( a 3 H’ 3 - a' 3 H 3 ) A q 3 - «Zr 2 - a' t H' % ) A q 2 ( XI.85) 
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Expanding a 3 H 3 and a 2 H 3 into a series in the vicinity of the point 
P for which the curl is being calculated, we can write that 


a 3 H\-a' 3 H’ 3 : 


9 (a,g,) 


A? 2 + e 3 , a 3 H\ — a' 2 H' 3 . 


9 (a 2 H 2 ) 


dq 2 1 “2“ a a s a 9s 

Consequently, expression (XI.85) can be written as 
Ci = dJ ^ Aq t Aq a - Aq 3 Aq 2 + S 


A? 3 + ^2 


1 

(9(a 3 H 3 ) 

9 (a* H 2 )\ 

h 2 h 3 

l dg 2 

9q s / 


} /l + < 


where / 3 is the area of the rectangle [see (XI. 74)] and e is a quantity 
of a higher order of smallness than f v 
Introducing the value of C l we have found into (XI. 84) and per- 
forming a limiting process, we arrive at the formula 


[Vah 


1 ( d ( a 3 H s ) d {a a H t y \ 

H 3 H 3 \ dq 2 dq 3 ) 


Similar formulas are obtained for the two other projections of the 
vector [Va]. All three formulas can be combined into a single one: 




f 9(aiHi) 

9 ( a h H h ) t 

l 9qh 

9qi I 


} (i = 1, 2, 3) (XI. 86) 


(the subscripts i, k, l form a cyclic transposition of the sequence 1 , 
2, 3). Finally, let us find an expression for the Laplacian operator 
in curvilinear coordinates. By (XI. 41), we have Aap = V (Vty) = 
= div grad o|>. 

Using formulas (XI.81) and (XI. 77), we obtain 


A ip = V (Vij>) > 




(— 


H 2 H 8 


Hi 

dqi ) 


I ^ 

(H 3 H x 

8* \ 


(H x H 2 

^ \\ 

1 9q 2 


9q , / 

+ 9q„ 

l H 3 

dq s IS 


(XI. 87) 


Introducing the values (XI. 69), we obtain an expression for the 
Laplacian operator in spherical coordinates 




r 2 sin 2 d 
1 dh J> 




(sind||-) 


r 2 sin 2 d dcp 2 


(XI. 88) 


In cylindrical coordinates 

. , 1 d f thj; \ 


i s 2 ^ . 


/yt 
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XII. Four-Dimensional Vectors and Tensors in Pseudo-Euclidean Space 

In Appendices VI and X, we considered vectors and tensors in 
Euclidean space in which the square of the position vector is deter- 
mined by the expression 

xj + x* +...+*£ = 2^? (XII. 1) 

If we introduce the tensor 

(10 ... 0\ 

01 ;;\° (xii. 2> 

0 0 ... 1 / 

expression (XII.l) can be written as 

2 g, k Xt*H (XII. 3) 

ii A == l 

The tensor g t k is called a metric tensor. 

The equations of the special theory of relativity and electro- 
dynamics acquire an especially simple and clear nature if they are- 
written as relations between vectors and tensors in four-dim ensional 
space whose metric is determined by the tensor 

( 1 0 0 0 \ 

o ~o -i o ) (XIL4> 

000 - 1 / 

For reasons that will be revealed later on, we shall use the follow- 
ing notation. We shall write the indices on the components of a four- 
position vector not as subscripts, but as superscripts. We shall 
write the Kronecker symbol as i.e. with one subscript and one- 
superscript. We shall use superscripts or subscripts on the compo- 
nents of the metric tensor depending on the circumstances. 

We shall adhere to the following rules in writing formulas: 

1. In each pair of dummy indices (i.e. indices over which sum- 
mation is to be performed), one will be a superscript and the other 
a subscript. 

2. Free indices (i.e. those over which summation is not performed)- 
will be placed in the same position — either as a superscript or a sub- 
script— on both sides of an equation. 

In the coefficients of the linear transformation of vector and tensor 
components, we shall also make one index a superscript and the 
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other a subscript 1 . Consequently, unlike the transformation formulas 
a\ — 2 a ik a k which we had to do with in Appendices VI and X, 

h 

we shall write similar formulas as follows: a'v- = 2 «v^ v - 

V 

Recall that in accordance with the condition we have adopted, 
whenever the symbol 2 1S absent, summation even over paired 
indices is not performed. 

A space whose properties are determined by the tensor (XII. 4) 
is called a pseudo-Euclidean one. We shall distinguish the coordina- 
tes, and also vector and tensor components in this four-space with the 
aid of the Greek indices p, v, etc., that can take on the four values 
0, 1, 2, and 3. We shall use the Roman indices i, k, . . . (running 
through the values 1, 2, and 3) on coordinates and vector compo- 
nents in conventional three-dimensional space (in space with a 
Euclidean metric). We shall sometimes use Roman indices on the 
components of four-vectors and four-tensors to underline that we 
Rave in mind non-zero values of the indices. 

Having determined the square of a position vector in the four- 
space being considered by means of an expression similar to (XII. 3) 
nnd taking into account (XII. 4), we obtain 

3 

2 gfivX^x'' = (x 0 ) 2 — (a: 1 ) 2 — (a 2 ) 2 — (a 3 ) 2 (XII. 5) 

n, v=0 

In the theory of relativity, the product of the time t and the speed 
■of light in a vacuum is taken as the coordinate x°, and the coordinates 
in conventional three-dimensional space are taken as the remaining 
■coordinates x l . Hence, 

x° — ct, x 1 — x, x 2 = y, x 3 — z (Xir.6) 

Having this in view, we can write expression (XII. 5) as 

3 3 

2 g^xW = (x 0 ) 2 - 2 ( * *) 2 = c 2 t 2 - r 2 (XII. 7) 

* H, v=0 i=i 

Formulas for the Transformation of Coordinates. In a transition 
to another coordinate system, the components of a four-position 
vector are transformed according to the linear law: 

3 

x» = 2 aS* v (XII.8) 

v=0 


1 We shall see in the following that one index is a superscript and the other 

one is a subscript on mixed components of tensors. It must be remembered, 
however, that linear transformation coefficients do not have the properties of 

tensor components. 
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The inverse transformation is performed by the formula 

3 

x» = 2 (XII. 9) 

v— 0 

where ctv are the coefficients of the inverse transformation. 

Owing to the invariance of the square of the four-position vector, 

3 3 

the condition must be observed that V, g^x^x' 1 — 2 inv x>ilx ' v - 

p, v=0 p, v=0 

Substituting for x'v- and x' v in this expression their values from 
(XII. 8), we obtain 


3 3 3 3 

2 = 2 2 a o xP 2 a i x ° 


p, V=0 


p, v=0 


P—0 


a=0 


Let us exchange on the right the places of the dummy indices fi 

3 

and p, and also of v and a. The result is 2 in v x>lxV ~ 

p, v=0 

3 3 

= 2 X^ x ' 1 2 i pa a p a v- 

p, v— 0 p, o=0 

This gives us the conditions which the coefficients of the linear 
transformation (XII. 8) must satisfy: 

3 

2 ipoaffi = £pv 
P, 0=0 

Having in view that g p0 differ from zero only when p = a [see 
(XII. 4)], this relation can be simplified as follows: 

2 £pp<*[M; = gpv (P, v = 0, 1, 2, 3) (XII. 10) 

p=o H 


For example, for p = v = 0, we obtain by this formula 

ft) 2 - «) 2 - (a*) 2 - (air = 1 (XII.ll) 

and for p = 1 and v = 2: 

aft - aft t - aft - aft = 0 (XII. 12) 

We must note that if we took the tensor (XII. 2) as g po , we could 
reduce formula (XII. 10) to the form 2 a P a v = 6p V > which coin- 

p 

cides with formula (VI. 40). 

1 1 is obvious that relations similar to (XII. 10) also hold for the 
coefficients aS). 

Now let us find the relation between the coefficients of the direct 
and inverse transformations. Remember that in Euclidean space, 
this relation is a ih = a kt [compare formulas (VI. 37) and (VI. 38)]. 
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Let us consecutively apply the transformations (XII. 9) and 
(XII. 8). The result is 

3 3 3 3 3 

*** = 2 a o x ' p = 2 a n 2 a v xV = 2 2 

p=0 p p=0 p v=0 v=0 p=0 p 

The component x 1 * can be written as 2 Sv£ v - It thus follows that 

V 

2 = (p, v = 0, 1, 2, 3) (XII. 13) 

p=0 

It is not difficult to see that the system of equations (XII. 13) 
will be satisfied if we assume that 

= a ®> aj, a*= — a?, a| = a* (i, k = 1,2,3) 

(XII. 14) 

Indeed, let p = v = 0. Equation (XII. 13) therefore becomes 

3 _ 3 

2 apag = 1 or, with account of (XII. 14), a“a® — 2 a o a o = 1 

p=0 j=0 

which agrees with (XII. 11). 

Assume that p = 1 and v = 2. Equation (XII. 13) now becomes 

3 _ 3 

2 ctpaf = 0 or, with account of (XII. 14), — aja® + 2 a J a 2 = 0 

p=0 1=1 

which agrees with (XII. 12). We can convince ourselves similarly 
that the remaining 14 equations contained in (XII. 13) are satisfied 
by the solution (XII. 14). 

We must note that relations (XII. 14) can be written as a single 
relation 

a p gpp == otpgw (XII. 15) 

Indeed, when p = v — 0, we have gpp = g vv , therefore aj = a°; 
when p — 0 and v = i 0, as well as when p = i 0 and v — 0, 
we have g pp = — g vv , owing to which a® = — a J and a l — — a°; 
finally, when p = i 0 and v = k 0, both factors (gpp and g vv ) 
equal — 1, so that = a£. As a result, we have arrived at relations 
(XII. 14). 

The theory of relativity usually deals with transformations in 
which the coordinates x 2 = y and x 3 = z remain unchanged (x' 2 = 
= x 2 and x' 3 = x 3 ). The matrix of the transformation coefficients 
in this case can evidently be written as follows: 

< a* 0 O'] 
ctj aj 0 0 
0 0 10 
0 0 0 1 


[«»]- 


(XII. 16) 
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The matrix of the inverse transformation coefficients has a similar 
appearance. 

Let us designate by p a parameter that can be used to characterize 
the difference between the systems K and K' . It can be, for example, 
the angle of rotation of one system relative to the other. In the theory 
of relativity, the role of the parameter p is played by the relative 
velocity of the systems K and K' . It is evident that as a result of a 
limiting process in which p shrinks to zero, both systems coincide, 
so that the matrix (XII. 16) is 


Hence, 




10 0 0 
0 10 0 
0 0 10 
0 0 0 1 


lim a® = lim a} = 1, lim aj = lim aj = 0 

p-» 0 p-*0 p-f 0 p->0 


(XII. 17) 


The four coefficients at} differing from zero and unity are not inde- 
pendent. Having written relation (XII. 10) for p = v = 0, we find 
that 

(ay - (air = 1 (XII. 18) 

A similar relation also holds for the coefficients of the inverse 
transformation: (a®) 2 — (aj) 2 = 1, whence with a view to (XII. 14) 
we obtain the equation 

(a ir - K) 2 = 1 (XII. 19) 

Now let us write condition (XII. 10) for p = 0 and v — 1: 

a° 0 a« - ceja| = 0 (XII. 20) 

A comparison of Eqs. (XII. 18) and (XII. 19) shows that (aj) 2 = 
= (aj) 2 , i.e. a‘ 0 — ±aj. A comparison of this condition with 
Eq. (XII. 20) gives two possible alternatives of the relations between 
the coefficients: 

(1) a° 0 = a* if a® = aj, and (2) a° 0 = —a} if a» = — aj 

The first alternative agrees with the property of the coefficients 
expressed by formula (XII. 17). Consequently, we must assume that 
a® = a{, and a® = aj. Introducing the notation a® = a* = a 0 
and a$ = aj = a lt we can write the matrix of the transformation 
coefficients as follows: 

a 0 Oi 0 0 

a l a 0 0 0 
0 0 10 
0 0 0 1 


KJ = 


(XII. 21) 
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This matrix contains only one independent coefficient (whose value 
is determined by the specific form of the transformation) because 
a 0 — aj) and a x = aj are related by Eq. (XII. 19): 

a 2 - = 1 (XII. 22) 


By (XII. 14), a“ = a“ — a 0 , and a“ = — aj 
the matrix of the inverse transformation is 


Kl = 


a 0 -a, 0 0 
— a, a 0 0 0 

0 0 10 

0 0 0 1 


— a x . Therefore, 


(XII. 23) 


Contravariant and Covariant Vectors. Substitution into (XII. 8) 
of the values of cti) from (XII. 21) leads to formulas for the trans- 
formation of the components of a four-position vector: 
x'° — a 0 x° -f ajX 1 , x' 1 = a^ 0 + a^x 1 , x' 2 = x 2 , x' 3 — x 3 (XII. 24) 

The set of the four quantities a°, a 1 , a 2 , and a 3 that transform in 
a transition from one coordinate system to another according to 
the same rules as the components of a four-position vector, i.e. 
according to the formulas 

3 

a>= 2 aW (XII. 25) 

v=0 

is said to be a four-dimensional vector (a four-vector). The coef- 
ficients in (XII. 25) have the same values as in (XII. 8). 

The component a 0 corresponds to the component x° of a four- 
position vector that in the theory of relativity is taken equal to ct 
[see (XII. 6)]. In this connection, the component a° is called a time 
one. The components a 1 , a 2 , a 3 correspond to the components x 1 = x, 
x 2 = y, x 3 = z, and are therefore called spatial components. 

Substitution into (XII. 25) of the values of a^ v from (XII. 21) leads 
to the following formulas: 

a'° = a 0 a° + a 1 a 1 , a' 1 = c^a 0 4- a ^a 1 , a' 2 — a 2 , a' 3 = a 3 (XII. 26) 

The square of a four-vector is determined by analogy with the 
square of a four-position vector [see (XII. 5)]: 

3 

2 ^a^ = (a'>) 2 -(a 1 ) 2 -(a 8 )* — (a 3 ) 2 (XII.27) 

n=o 

We must note that the square of a four-vector can be either positive 
or negative; particularly, it may be zero. 

In writing vector formulas for a pseudo-Euclidean space, we become 
confronted with a major inconvenience because the square of a four- 
vector is determined by expression (XII.27) that cannot be written 
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in a compact form as 2 (^) 2 - This inconvenience is eliminated by 
introducing two kinds of four-vectors. They are distinguished by 
using a superscript on vectors of one kind and a subscript on vectors 
of the other kind. The first kind of vectors are called contravariant, 
and the second, covariant. Hence, by using a superscript on the 
components of a four-position vector, we have related this vector 
to the category of contravariant ones. 

A covariant vector a H corresponds to each contravariant vector a* 1 
(and vice versa), and it is assumed that 

“n = (XII. 28) 

[see (XII. 4)], i.e. 

a 0 — a 0 , a x — —a 1 , a 2 = —a 2 , a 3 — —a 3 (XII. 29) 
It is not difficult to see that relation (XII. 28) can be written as 

at* = (XII. 30) 

Consequently, raising or lowering of the index p on the component 
of a four-vector is attended by multiplication of the component by 
gmi or 8**- 

When using contravariant and covariant components, the square 
of a four-vector [see (X 11.27)] can be written as follows: 

3 

2 ^ 0(1 = a°«o + a l a i -\- a 2 a z + a 3 a 3 (XII. 31) 

n=o 

By analogy with formula (XII. 31), which gives the product of 
two identical four-vectors, we can determine the scalar product of 
two different four-vectors: 

3 

2 = a°b 0 + + a 2 b 2 + a 3 b 3 = a°b° — a l b l 

d=0 

The following expression is obvious: 

3 3 

2 2 a nb' x 

M=0 |i— 0 

In general in any pair of dummy indices, we may exchange the places 
of the superscript and subscript. 

It must be noted that in purely spatial rotations (i.e. transforma- 
tions not affecting the component a°), the three spatial components 
of the four-vector a>* behave like the components of a vector in three- 
dimensional Euclidean space (the component a° behaves here like 
a three-dimensional scalar). In this connection, the components 
of a four-vector can be written as 

at* = (a°, a t ) (i = 1, 2, 3) 


— a z b 2 — a 3 b 3 

(XII. 32) 
(XII. 33) 


(XII. 34) 
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or, more briefly, 

a» = (a®, a) (XII.35) 

The covariant components of the same vector are as follows: 

% = (a 0 , —a,) (i = 1, 2, 3) (XII.36) 

or 

a* = (a 0 , -a) (XII.37) 

The square of a four-vector can be expressed as 

2 0^ = (u°) a — 2 a? = (a 0 ) 2 — a 2 (XII.38) 

n t 

and the scalar product of four-vectors as 

2 — ab (XII. 39) 

|X 

It is a simple matter to see that the increment of the square of 
a four-vector can be written in two ways: 

6 2^ = 22^8^ = 220^ (XII. 40) 

I* M- M- 

Four-Dimensional Gradient. Assume that we are given a scalar 
function of the quantities x : 

cp (x°, x 1 , x 2 , x 3 ) 

According to the rules of differential calculus, the increment of 
this function is given by the expression 

d( p = So d *° + -B- dxi + -B- dz2 +-& dxZ 


The increment of a function cannot depend on the coordinate 
system it is being calculated in, i.e. is an invariant. We thus con- 
clude that the quantities 


d<f dq> 3q> 3q) 

dx° ’ dx 1 ' dx 2 ’ dx 3 


(XII. 41) 


behave like covariant components of a vector in transformations 
of the coordinates. This vector is called the four-gradient of the 
function <p. 

If we want to introduce the four-operator of the gradient (the oper- 
ator V*), its covariant components must be determined as follows: 


V 


* 

o 



d 

dx i ’ 




(XII. 42) 


or 


VS= (iJr. v) 


(XII. 43) 
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where V is the operator of the gradient in three-dimensional Eucli- 
dean space. Consequently, the contravariant components of the 
operator V* will be 

V .._JL, V“=--Jr, v*>--^ (XII.44) 

which can be written as follows: 

v ”‘"hJiT' -V) (XII.45) 

We are now able to explain the essence of the difference between 
contravariant and covariant vectors. By (XII. 8 ), the derivative of 
x'* with respect to x v is ajj: 

otv =— rr (XII. 46) 

dx 

Introducing these values of into (XII. 8 ), we can give the for- 
mula for transforming the contravariant components of a four- 
vector to the form 

3 

a'* =2 (XII.47) 

v=0 X 

Considering a scalar function 9 as a composite function of the 
form 

<p = q> [a : 0 (x'°, x' 1 , x' 2 , x' 3 ), x 1 {x'°, x' 1 , x' 2 , x’ 3 ), . . .] 
we can write that 

3 

d<p in 3«p dx 1 

dx* ~ dx -v. 

v=0 

By (XII. 41), dyldx'v- is the p-th covariant component of the gradient 
of (p calculated for the system K' [let us designate it by (V*cph[], 
and dtp /dx v is the v-th covariant component of V*<p calculated for 
the system K [let us designate it by (V*<p) v ]- We can therefore write 

(V*<p)|» = 2-^r(V*<P)v (XII. 48) 

V=0 

A comparison of transformations (XII. 47) and (XII. 48) shows that 
they are not identical. The role of the coefficients in one of them 
is played by the quantities dx'^ldx v , and in the other by the quanti- 
ties dx v /dx'v-. 

Hence, vectors are said to be contravariant whose components 
transform according to the law (XII. 47), i.e. like the components of 
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the vectors or dx» (like the differentials of the coordinates). 
Vectors are said to be covariant whose components transform accord- 
ing to the law (XII. 48), i.e. like the components of a gradient (like 
the partial derivatives with respect to the coordinates). 

We have seen above that if we are given a contravariant vector a 
we can always use the rule (XII. 29) to introduce the covariant vector 
corresponding to it (and vice versa). Since the quantities and a u 
are close relatives, we shall treat them not as components of two 
different vectors, but as the contravariant and covariant components 
of the same vector. 

For the Euclidean metric, the inverse transformation coefficients 
at) are related to the direct transformation coefficients <Xy by the 

expression otv = a£. Consequently, the direct and inverse trans- 
formations of coordinates have the form 


whence 


x'v- = 2 a£.r v , x v = 2 

v H 


dx** _ dx v 
dx v dx 11 


Hence, when a space has a Euclidean metric, the coefficients of the 
transformations (XII. 47) and (XII.48) coincide, and the difference 
between the contravariant and covariant vectors (and also the 
tensors) vanishes. 

Transformation of Covariant Components. In a transition from the 
system K to the system K ' , the covariant components of vectors 
obviously transform like their contravariant counterparts according 
to a linear law. Designating the coefficients of covariant component 
transformation by the symbol a£, we can write 

3 

a^= 2 a Jo, (XII. 49) 

Let us find the relation between the coefficients of transformations 
of contravariant and covariant components, i.e. between the quanti- 
ties <%v and a%. Raising or lowering of the index p (or v) on the com- 
ponent of a vector is attended by the multiplication of this compo- 
nent by g w (or g vv ). Expression (XII. 49) can therefore be transformed 
as follows: 

3 ~ 

g w = 2 (a^gw) a? (XII.50) 

rr v—o * 

At the same time for contravariant components, we have 

, 3 

y I ■“{%' , 

a ^ = 2 a v flV 

’ V—0 
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Multiplying the left-hand and right-hand sides by g ^ we obtain 

3 

S (XII. 51) 

v=0 

A comparison of formulas (XII. 50) and (XII. 51) shows that 

4#w = 4gpp (XII. 52) 

With a view to (XII. 4), it is simple to obtain that 

5; = < «? = ■-<**, 4 = -a?, 4= 4 (i, k — 1, 2, 3) (XII.53) 

A comparison of relations (XII. 15) and (XII. 52) allows us to 
write 


4 = 4 (XII. 54) 

where 4 are the coefficients of inverse transformation of the contra- 
variant components. 

We saw above that 4 — dx' v ldx»- [see (XII. 46)]. It is evident 
that 


Introducing this expression into (XII. 54), we obtain coefficients 
for the direct transformation of the covariant components 


a 


V 



dx v 
dx ** 


This agrees with (XII. 48). 

Four-Dimensional Tensors. By a four-tensor of the second rank 
is meant a set of the 16 quantities T^ v which in a transition from 
one coordinate system to another are transformed by the formula 

3 

T'^ = 2 44T pa (XII. 55) 

P , <7=0 

where 4 are the coefficients from (XII. 21) [compare with (X.10)]. 
In the inverse transformation 


3 

T^ V = 2 ^4^' PCT (XII. 56) 

p, v=o p 


A particular case of a four-tensor is the tensor with the com- 
ponents 


rp» v = av-b v 


(XII. 57) 


where and b v are components of four-vectors. 

The components of a four-tensor can be represented in three 
forms: as contravariant ones T^ v , covariant ones T nv , and mixed 
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ones By analogy with (XII. 57), raising or lowering of the time 
index does not change the sign of a component, while raising or 
lowering of a space index reverses the sign of a component [compare 
with (XII. 29)]. Raising or lowering of an even number of space 
indices obviously leaves a component unchanged. Hence, 


T too t T 01 

1 00 — 1 ) 1 01 — 1 i 

7'° 0 — T°° T 0 1 = t 01 


., T 10 = -f 10 , T n = JT“ 


J’O _ J'Ol _ J’lO ' 

77 = 7 11 ! = — r 11 , . . . 


(XII. 58) 


Formulas for the transformation of the covariant components of 
the tensor (XII. 57) can be obtained by taking (XII. 49) into account: 

n; v = a'nK = s a p S = S b 0 = 2 «*p a«n p0 

p a p, a ^ p,a ^ 

Similarly 


nv= 2 a p 7^ p0 (XII. 59) 

P. o 

where the coefficients a p are determined by the rules (XII. 53). 

In the same way, we can arrive at the formula 

= 2 a WT P o (XII. 60) 

p, <j 

Examination of (XII. 58) reveals that in general we must distin- 
guish between the mixed components 7’% and T^, i.e. see which 
index — the first or second— is a superscript and which is a subscript. 
Indeed, for instance, in the general case 7’ 1 0 = 7 110 T ot — 7’ 0 1 . 
For the symmetric tensors S^ v (for which = ■S'T), the mixed 
components and Sf evidently coincide so that the indices 
may be placed one above the other. For antisymmetric tensors, the 
relation .4% = —Af holds. 

In an antisymmetric four-tensor, six components are independent 
(four diagonal ones are zero, the others satisfy the condition — 
— —^4 vp). Consequently, the array of the components of an anti- 
symmetric four-tensor is as follows: 

( 0 A 0i A 02 4 03 \ 

— A 01 0 A i2 A 13 1 

— A 03 — A 12 0 A 23 I 

— 4 03 — 4 13 — 4 23 0 / 


( 4 “ v ) = 


(XII. 61) 
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Let us find formulas for transforming the components of the tensor 
(XII. 61). By (XII. 55) for the component X 01 , we have 

A ' 01 = 2 2 aj 2 aJA'" 

p, <J=0 p-=0 cr=0 


= a® 2 aU 0o +a; 2 o^'+aj 2 «u 2, +«; 2 aU 3 ° 


a— 0 


( 7=0 


a=0 


a=0 


The coefficients a® and a” are zeros [see (XII. 21)], therefore the last 
two sums will vanish. Of the four coefficients a' a , only two are non- 
zero: aj and a‘. Consequently, in the first two sums, only the first 
two addends are non-zero. Hence, 

A’ 01 = a“ (a *A°° + aJA 01 ) + (a l 0 A 10 + a’A") 

Taking into account that A 00 = A 11 = 0, A 10 = —A 01 , substituting 
a 0 for aj and a^, and also 0 Cj for aj and a°, we arrive at the formula 

A ' 01 = (a 2 -cc 2 ) A 01 == A 01 

[see relation (XII. 22)]. In a similar way, we can obtain transform- 
ation formulas for the other components of the tensor A^ v . 

The formulas for the transformation of all six independent com- 
ponents are as follows: 


A' 01 = A 01 , X' 02 = a 0 A 02 + c^A 12 , A' 03 = a 0 A 03 + ctjA 13 
A' 12 = a 0 A 12 + ai A 02 , A' 13 = a 0 A 13 + a X A 03 , A' 23 = A 23 

(XII. 62) 

We must note that the formulas for the inverse transformation 
differ from formulas (XII. 62) only in the sign of the terms containing 
the factor a x [see (XII. 23)). 

Wo need formulas (XII. 62) in electrodynamics. 

Let us form the following expression from the components of 
the tensor n^ v = a^b v : 


2^,-2^ 

p p 

This expression is an invariant, i.e. a scalar. Similarly, for any ten- 
sor the expression 

22 , ‘V=7 ,0 o+z ,l l +r%+r , s (xn.63) 

p 

is a scalar [compare with (X.21)]. It is called the trace of the tensor. 
A glance at (XII. 33) shows that 2 n^= 2 IV*. Similarly 

2 fp*2V (XII. 64) 

p p 
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The scalar product of the four-vector a v and the four-tensor T^ v 
is defined to be the four-vector whose components are determined 
by the formula 

(XII.65) 

V 

Invariant Four- Tensors. A tensor which when multiplied by a 
vector leaves the latter unchanged is naturally called a unit tensor. 
The components of this tensor must obviously be taken equal to 

f 1 if p = v 

6 Ho if U* < XII ' 66 > 

Indeed, the introduction of these values into formula (XII.65) 
yields 

b* = 2 8$a v = a* 

V 

as matters should be for a unit tensor. 

It follows from (XII. 66) that 6v = 64, i.e. that the tensor 
is symmetric. This is why we have arranged the indices one over 
the other. The trace of the tensor 6v is 

4=0 


The contravariant and covariant components of a unit tensor are 
customarily designated by the symbols gw and g^ v . It can be seen 
that ! 


( 1 0 0 0 \ 
0-1 0 0 \ 
0 0—1 0 I 
0 00 - 1 / 


(XII. 67) 


A cpmparison with (XII. 4) shows that the tensor gf* v (like g^) 
is a metric tensor. 

Let us form the expression 

2 Sii\ aV 

• V 

Taking into account (XII. 67), we find that when p = 0, this expres- 
sion is a 0 = a 0 , while when p = i (i = 1, 2, 3), we obtain — a x = 
= a t . It thus follows that 

2 ~ 
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Similarly, we can see that 

2 = a" 

V 

Consequently, the scalar product of the vectors a* 1 and b v can 
be written in any of the following three ways: 

2 aiL W = 2 g^b v = 2 g^aj) v (XII. 68) 

|A H » V p. , V 

The tensors 6v, g^, and g pv are invariant — their components are 
identical in all coordinate systems. The absolutely antisymmetric 
unit four-pseudotensor of the fourth rank, e pvpa , is also invariant. 
The components of this tensor are determined similarly to those 
of the pseudotensor e iht in Euclidean space [see (VI. 15)]. If at least 
two indices coincide, e pvpo is zero. Therefore, of its 4 4 = 256 com- 
ponents, only 4! = 24 components are non-zero. It is assumed that 

g 0123 = +1 (XII. 69) 

The remaining 23 components are assumed to equal +1 or — 1 de- 
pending on what number of permutations of two indices— even or 
odd — gives us the sequence p, v, p, a from the sequence 
0, 1, 2, 3. It is obvious that 12 components have the value -f-1 
and 12 the value — 1. 

According to the rule we have adopted above, lowering of all 
the non-zero indices in e pvpCT must change the sign of the relevant 
component. Therefore, e 0123 = —1. Similarly, e pvpCT = — eP vp(J 
(of the four indices, one must be 0, and the other three — 1, 2, 3). 
It thus follows that 

2 e* ivpa e 4 vp7= — 24 (XII.70) 

4. V, p, a 

The Ostrogradsky-Gauss Theorem. In Euclidean space, the Ostro- 
gradsky-Gauss theorem is written as follows [see (XII. 13)]: 

3 3 

J S -Sr dv =§ 2 a i d fi (xii. 7i) 

V i = l 1 i=l 

(the integral of the divergence of the vector a over a certain volume V 
equals the flux of this vector through the surface / enclosing V). 
The quantities df t are components of the vector dt = n df. They have 
the values df x = dy dz, df y = dz dx, df z = dx dy. 

The following relation is a generalization of the Ostrogradsky- 
Gauss theorem for pseudo-Euclidean four-space: 

j 2r^r dv '=§ 

p=0 4=0 


(XII. 72) 
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where aP is a four-vector, dV* — dx° dx 1 dx 2 dx 3 — c dt dV is an 
element of the volume in four-space, d/^ is a component of the four- 
vector of an element of the hypersurface enclosing the four-volume 
over which the integral on the left-hand side of formula (XII. 72) 
is taken. The components d/* 1 have the values df = dx 1 dx 2 dx 3 = 
= dV, df 1 — dx° dx 2 dx 3 = c dt dy dz, etc. 

Let us use the Ostrogradsky-Gauss theorem to prove a relation 
that we need in Sec. 40. To make our proof more obvious, we shall 




Fig. XII.l. 


first give it for Euclidean three-space, and then perform similar 
calculations for four-space. 

Assume that we have the vector a t whose divergence is zero: 


2-Sr=° 


Let the vector a t be non-zero in a restricted region of space. By 
(XII.71), we have 

5 s-S- dF== §2 a «^ = ° ( xiL73 ) 

' 1 1 f i 

Equation (XII.73) holds for an arbitrarily taken volume V and 
the surface / enclosing it. Let us take as V the volume confined be- 
tween two infinite planes x x = x™ — const and sq = x ,2) = const 
(Fig. XII. la). The integral over the side surface of this volume is 
zero because, by assumption, at infinity a t — 0. Consequently, the 
right-hand side of formula (XII.73) can be written as (for the plane 

j fli dft— j a l df i = 6 

<i) (2) 
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c j = const, the components df 2 and df 3 are zero). Hence 

j a« df t = j a t df t — const 
(i) (2) 

The result we have obtained signifies that the integral j a x df x 

;aken over any infinite plane x x = const has an identical value. 
kVe must note that the coordinates x 2 and x 3 take on all values from 
— oo to +<x> in integration. 

Now let us take as V a volume enclosed by two surfaces of an arbi- 
trary shape for all of whose points the coordinate x l is finite, while 
the coordinates x 3 and x 3 vary from —00 to + 00 . Consequently, 
the edges of the surfaces are at infinity (Fig. XII. lb). In this case, 
the right-hand side of formula (XII. 73) can be written as 

J J 

fl i ft i 

whence 

j 2 a « dfi — const (XII. 74) 


i.e. has the same value for any surface including all the two-dimen- 
sional space x 2 x 3 (the space yz). 

Now let us assume that there is a tensor T lh satisfying the con- 
dition 


V dT ‘ h - n 

2j 8 X h 


(2 = 1 » 2, 3) 


(XII. 75) 


The components of T ih differ from zero in a restricted region of space 
Let us form the vector a having the components 

a k = l]T lk b t (XII. 76) 

i 

where b t are the components of an arbitrary constant vector (b t = 
= const). The divergence of the vector a will be zero. Indeed, 

2 - fi - 2£2 *■«».- 2 

J* hi i h 

Therefore, the vector (XII. 76) satisfies the conditions in which 
relation (XII. 74) is observed. Substitution of the values (XII. 76) 
into (XII. 74) yields 

j 2 df h ~ j 2 ( 2 r.fc&i ) df h = 2 b t j 2 T i* d fh = const (XII. 77) 

h k i h 


57-018 
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(integration is performed over the surface including the entire two- 
dimensional space x 2 x 3 ). 

Introducing the symbol 

Pi = f 2 T th df k (XII. 78) 

J k 

we can write relation (XII. 77) as 

2 i>iPi — const 

i 

Because of the constancy and arbitrariness of the quantities b t , 
the last expression can be observed only provided that all the p t 's 
are also constant. 

We have thus arrived at the following statement: if the tensor 
T ih satisfies condition (XII. 75) and its components are non-zero in 
a restricted region of space, the values of the vector components p t 
do not depend on over which of the surfaces confining the entire two- 
dimensional space x 2 x 3 integration in formula (XII. 78) is performed. 

Now let us reason similarly for four-space. Assume that there is 
the vector a v whose four-divergence is zero: 

2^r=o ( xn - 79 > 

v=0 X 

(a v are non-zero in a restricted region of four-space). By (XII. 72), 
we have 

J2^^ = §2« v ^/v = 0 (XII. 80) 

V V 

This equation holds for any closed hypersurface; the integral on the 
left-hand side is evaluated over the four-volume confined by this 
hypersurface. Let us take as this volume the part of four-space con- 
fined between two infinite hyperplanes x° = x° (t , = const and 
x° — a:? S) = const. The coordinates x 1 , x 2 , x 3 for points of such 
hyperplanes*vary from — oo to +oo. Consequently, each of the hyper- 
planes consists of the whole three-dimensional space taken at the 
instant fj = x° (n /c for the first plane and at the instant t 2 = x\Jc 
for the second one. 

For the chosen four-volume, the right-hand side of relation 
(XII.80) can be written as 

a 0 cf/ 0 — a 0 d/ 0 =0 
( 2 ) 

(since dx° — 0, all the df' s are zero). Hence 

' j fl° df 0 mm 


\ 


const 


(XII. 81) 
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This means that the value of the integral (XII. 81) does not depend 
on which of the hyperplanes x° — const integration is performed 
over. 

Now let us take a four-volume enclosed by two hypersurfaces 
of an arbitrary shape for all of whose points the coordinate x° is 
finite, and the coordinates x 1 , x 2 , x 3 take on values from — oo to + oo. 
Such hypersurfaces include the entire three-dimensional space. 
Writing the right-hand side of relation (XII. 80) for such a volume, 
we have 

j 2 aV df v — j 2 a v <f/v = 0 

(1) v (2) v 

Hence, the integral 

j 2 a v rf/ v = const (XII. 82) 

V 

i.e. does not depend on which of the hypersurfaces enclosing the 
entire three-dimensional space it is taken over. In other words, this 
integral is time-independent, and its value is conserved. 

Let us take as a v the four-vector with the components 

a v = 2 (XII. 83) 

n 

where b ^ are the components of an arbitrary constant four-vector 
(6u = const), and 7’a v is a four-tensor satisfying the condition 

3 

2-fr— 0 (11 = 0,1,2,3) (XII .84) 

OX 

v=0 

The components of this tensor are assumed to be non-zero in a restrict- 
ed region of four-space. It is not difficult to see that the vector 
(XII. 83) satisfies condition (XII. 79). Consequently, relation (XII. 82) 
must be observed for it. Substitution of the values (XII. 83) into 
(XII. 82) yields 

j 2 a vd /v= j 2 ( 2 d /v= 2 b » j 2 Ttxv df v = const 

v V u n v 

(integration is performed over an arbitrary hypersurface including 
the entire three-dimensional space). 

If we introduce the symbol 

j2f^d/ v (XII. 85) 

V 

the relation we have obtained can be written as 

2 b^pv- = const 


27* 
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Owing to the arbitrariness and constancy of the quantities b the 
latter condition can be observed only provided that all the p*' s are 
constant (i.e. time-independent). We thus arrive at the following 
conclusion. If there is a tensor 7' (lv whose components are non-zero 
only in a restricted region of four-space and satisfy condition (XII .84), 
the components of the four-vector (XI 1. 85) are conserved , i.e. do not 
change their value with time. It is obvious that the four-vector with 
the components 

= a (XII. 86) 

V 

where a is an arbitrary constant, will also be conserved. 

XIII. The Dirac Delta Function 

The Dirac delta function (6 function) is defined as the function 
determined as follows: 8(x) — 0 at all x's differing from zero; at 
x — 0, the function 6 ( x ) becomes infinite, and so that 

+ 00 

J6(x)dx=l (XIII.l) 

— oo 

The delta function is useful owing to its following property: 

+ oo 

j /(*) 8 (*)&; = / (0) (XIII. 2) 

— oo 

where / (x) is an arbitrary continuous function of x. This property 
follows from the definition of the delta function. Indeed, since 
8 (x) — 0 at all x =)=. 0, only the vicinity of the point x — 0 makes 
a non-zero contribution to the integral (XIII. 2). In this vicinity, 
f (a:) can be assumed equal to / (0). Putting / (0) outside the integral 
sign and taking (XIII.l) into account, we arrive at (XIII. 2). 

It is evident that the function 8 (x — a) has the same properties 
in the vicinity of the point x = a as the function 8(x) in the vicinity 
of the point x = 0. Particularly, 

+oo 

j f(x)8(x-a)dx = f{a) [(XIII.3) 

— OO 

The integration region in formulas (XIII. 2) and (XIII.3) must 
hot necessarily extend from — oo to -f-oo, it is sufficient for this 
region to include a singular point at which the delta function is 
non-zero. 

Like our introduction of 6 ( x ), a three-dimensional delta function 
designated by 8 (r) is defined. It is zero everywhere except for the 
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origin of coordinates. At the origin of coordinates, 6 (r) becomes 
infinite so that 

J 6(r) dV = 1 (XIII. 4) 

The integral is evaluated over the entire three-dimensional space. 

The three-dimensional delta function can be written as the 
product of three one-dimensional delta functions: 

8(r) = 6(x) 6 (y) 6(2) (XIII. 5) 

It follows from the definition of the delta function that 

j /(r) 6(r - r 0 ) dV = f(r 0 ) (XI 1 1. 6) 

The integration region in formula (XIII. 6) must not necessarily 
include the entire three-dimensional space, it is sufficient that this 
region include the point determined by the vector r 0 . 


XIV. The Fourier Series and Integral 

A function satisfying the condition 

f(t+T)=f ( t ) (XIV.l) 

where T is a constant, is said to be periodic 1 . The quantity T is 
called the period of the function. A very simple example of a periodic 
function is the harmonic function / (£) = a cos (00 t + a), where 
a) = 2 n/T is the cyclic (angular) frequency of the function. 

The overwhelming majority of periodic functions encountered 
in physical problems can be written as the series 


/ (0 = 2 ( a n cos n(0 o f + K sin nco 0 1) 

71=0 

For brevity, we have used the notation 



(XIV. 2) 


(XIV. 3) 


where T is the period of tiie function. 

The series (XIV.2) is called the Fourier series. The constants a n 
and b n are called Fourier coefficients. We shall not discuss the con- 
ditions which a function must satisfy for its values to coincide with 
those of the series (XIV.2), referring readers interested in this matter 
to texts on calculus. 

A non-periodic function can also be represented as a Fourier 
series. But such a representation will be suitable for the non-periodic 
function only on the segment from — T/2 to -f 772. 

*Having in mind the applications, we have designated the independent 
variable by t instead of by x. 
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Expression (XIV.2) is an expansion of the function / ( t ) into a series 
in the functions 


1, cos o) 0 £, sin w 0 L . . cos nco 0 i, sin na 0 t, . . . (XIV.4) 

The system of functions (XIV.4) is orthogonal over the segment 
— 772, +77 2. This signifies that ari integral of the product of two 
different functions of the system over this segment is zero, while 
a similar integral of the square of any function is non-zero. The 
property of orthogonality can be written briefly as 

+172 

( ’M’rn 

-r/ 2 


Hereij) n and i|) m are any functions belonging to the system (XIV.4). 
It is a simple matter to see that q 0 = T and q n — 772 ( for n =+0). 
For this end, it is sufficient to recall that the average values of the 
square of a cosine and the square of a sine are 1/2. 

The orthogonality of the system of functions (XIV.4) allows us 
to find the values of the Fourier coefficients. Let us multiply relation 
(XIV.2) by the function cos mw 0 t (here m +=0) and then integrate 
over the segment —772, +772. Owing to orthogonality, of all the 
integrals on the right-hand side, only one will be non-zero, namely, 

+77 2 

j a m cos 2 mco 0 f dt=a m — 

-27 2 


We therefore arrive at 
whence 


+77 2 

the formula ^ / (/) cos nu» 0 t dt = a m T/2, 

-T/2 


+ 772 
2 r 

&m — -f \ / (<) cos nuo 0 t dt (m 0) 

-27 2 


(XIV. 5) 


Similar calculations lead to expressions for b m and a 0 : 


+772 

b m = -jr j / (l) sin ma 0 t dt (m=£ 0) (XIV. 6) 

-T/2 
+77 2 

a o = -jr j / (t) dt, b 0 = 0 (XIV. 7) 

-T/2 

The Fourier series can be written in the complex form. For this 
purpose, we replace the cosine and sine with exponentials: 
e in&>oi 1 e -ina>ot tf tn©o< 

cosnci) 0 f= , sinmo 0 f = ^ 
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The expression in parentheses in formula (XIV. 2) now becomes 
e tnWof i e - inojoi e ina>ot g -irta>o( 

a n 2 ^ 21 

_ _0n + ibn_ e _ {na>0 t 4_ °n — ib n eln<aot = C n e~ ina ot 4 . c_ n e*»“o< (XIV. 8 ) 

We have introduced the following coefficients instead of a 0 , a n , 
and b n : 

Co = a o» •, g-n = --" - y* B (n^O) (XIV. 9) 

We must note that 

C-„ - Cl (XIV.10) 

(the asterisk stands for a complex conjugate quantity). 

Making the substitution (XIV. 8 ) in (XIV. 2), we obtain 

+ oo 

/(r)= S C n e~ inaot (XIV.ll) 

n= — OO 

Expression (XIV.ll) is an expansion of the function f(t) into 
a series in the functions 

1, e ±uo,t t e ±i 2 «o« f ... (XIV. 12) 


The system of functions (XIV. 12) is also orthogonal over the seg- 
ment — 772, +772. For complex functions, this signifies that 
+17 2 

I ilv^m (here q n =+0). Indeed, it is not difficult to 

-T/2 

see that 

+T/2 

f e i m-noaot dt = 8 nm 7’ (XIV. 13) 

-T/2 


The orthogonality of the system of functions (XIV. 12) allows us 
to find tho values of the coefficients C n . Let us multiply relation 
(XIV.ll) by and then integrate over the segment — 772 , 

+ 772. Owing to orthogonality, of all the integrals on the right-hand 
side, only one will be non-zero, namely 


+T/2 

f C m e~ i(n - m ^ co o‘ dt — C m T 
-r /2 


+T/2 

We therefore arrive at the formula \ f(t) 

-r /2 


dt 


C m T, 
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If two functions q> and / are related by an expression of the form 

k 

q> (x) = j f(t) K ( x , t) dt (XIV. 23) 

CL 

the function <p ( x ) is said to be the integral transform of the function 
_/ ( t ), and the function K {x, t ), the kernel of the integral transform. 

A comparison of formulas (XIV. 21) and (XIV. 23) shows that the 
■function cp (©) determined by expression (XIV. 21) is the integral 
transform of the function / (|), the kernel of the transform being 
K (to, i) = (1/|/ 2 n) **“£. This is why the function cp (o>) is called 
the Fourier transform of the function / (£). The function / ( t ) de- 
termined by expression (XIV.22) is called the inverse Fourier trans- 
form. We must note that the direct [formula (XIV. 21)1 and the 
inverse [formula (XIV.22)] Fourier transforms differ only in the 
■.sign of the exponent. 

The function / (§) in expression (XIV.21) is also called the Fourier 
nmage of the function cp (co). 
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Acceleration, four-dimensional, 135, 136 
Action, 121ff 

charged particle in electromagnetic field, 
232ff 

for closed system, 148 
contracted, 123 
dimension, 34 

for electromagnetic field, 234 ff 
field and particle system, 236 
generalized expression, 147 
quantum, 34 

for relativistic particle, 143ff 
variation, 238 
Angle(s), 

Euler, 85ff 
nutation, 87 
precession, 87 
proper rotation, 87 
Apex, 86, 113 
Axes, 

coordinate, inversion, 327f 
free, of rotation, 99ff 
principal, of inertia, 91, 92 


Body, rigid, see Rigid body 
Bracket, Poisson, 119f 


Charges, 

bound, 166ff 
free, 16C 
■Coefflclent(s), 

Coupling. 80 
Lamf, 388f 

Cofactor, algebraic, 340, 341 
Collisions, clastic, 49ff 
Condition, Lorentz gauge, 205 
in four-dimensional form, 219 
Conics, 309 ff 
Constant, dielectric, 171 
Constraint(s), 1 4 f 
equations, lit 
geometric, 14 
holonomic, 14f 
ideal. 15 
integrahle, 14 
kinematic, 14 
non-liolonomic, 15 
non-integrable, 15 
non-stationary, 15, 297ff 
rheonomous, 15 
scleronomous, 15 
stationary, 15 
Continuum, 11 
Coordinates, 

Cartesian, and generalized coordinates, 20 
curvilinear, 387ff 
orthogonal, 388ff 
cyclic, 118 


Coordinates, 
cylindrical, 389 
generalized, Ilf, 19, 82 
cyclic, 41 
normal, 76 
principal, 76 

transformation formulas, 394ff 
Curl, 37 5ff 

position vector, 383 
Curve, 

eccentricity, 310f 
parameter, 311 


Damping, aperiodic, 68f 
Density, 
charge, 159 

bound, 169, 172 
displacement current, 201 
electric current, 209 
energy, electromagnetic field, 209f 
momentum, 154 

electromagnetic field, 213f 
momentum flux, 155 

electromagnetic field, 211 
power, electromagnetic field forces, 208f 
total current, 201 
Determinant(s), 338ff 
minor, 340 
multiplication, 342 
order, 339 
transposed, 340 
Dipole, 

field potential, 163, 165 
field strength, 163 
Dirac, P.A.M., 177 
Directions, principal, 356 
Discriminant, quadratic form, 348 
Disorder, in transposition, 320 
Displacement, electric, 170f, 174 
true, 298 
virtual, 298 
Divergence, 372ff 

in curvilinear coordinates, 390f 
in n-dimensional space, 374 
position vector, 383 


Effect, Doppler, 
longitudinal, 257 
transverse, 257 
Einstein, A., 125, 127, 141 
Electric field, 289f 
arbitrarily moving charge, 280fl 
dipole radiation, 293 
uniformly moving charge, 275, 281 
Electromagnetic field, 
energy, 208 
energy density, 209f 
energy flux density, 209f 
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Electromagnetic field, 
four-potential, 219, 221 
momentum density, 21 3f 
momentum flux, 211 
momentum Dux density, 211 
potentlal(s), 203ff 
gauge transformations, 204f 
scalar, 203ff 
vector, 203ff 
Ellipse, 309 

canonical equation, 309 
eccentricity, 309 
equation, 312 
Ellipsoid, 

angular momentum, llOf 
energy, 11 Of 
Inertia, 91 
tensor, 366, 368 
Energy, 

centrifugal, 62 
closed system, 37 
dissipation, 66 
free symmetric top, 109f 
kinetic, 
particle, 141 
rigid body, 89, 94 
rotation, 89, 90f 

particle, in uniformly rotating coordinate 
system, 62 
pendulum, 30 
potential system, 16, 64 
relativistic expression, 139, 144 
rest, particle, 141 
total, 

particle, 140 
system, 25 

translational motion, 89, 94 
Equation(s), see also Formula(s) 
canonical, 115ff 
ellipse, 309 
hyperbola, 309f 
parabola, 310 
characteristic, 74 
matrix, 351 
constraints, 14f 
continuity, 200 

from charge conservation, 218 
in differential form, 188 
in integral form, 188 
d’Alembert’s, 207, 248 
ellipse, 312 

Euler’s, 105ff, 307, 308 
spherical top, 107 
Hamilton-Jacobi, 123 
classical approximation, 247 
for contracted action, 124 
relativistic, 146 
Hamilton’s, 115ff 
hyperbola, 31 2f 
invariance, 125 
Lagrange's, 12f, 1711, 23 
corresponding to cyclic coordinates, 41 
generalization, 149f 
in generalized coordinates, 19ff, 23 
for holonomic system, 297ff 
from least action principle, 34 
Laplace’s, 160 
linear, 

differential, with constant coefficients, 
3 1 3ff 

homogeneous systems, 345ff 
non-homogeneous systems, 342ff 
Maxwell’s, 127, 201ff, 228ff, 238 

derivation from least-action principle, 
237 f 
motion, 

centre of mass, 102 


Equations, 

motion, 

particle in field, 230f 
rigid body, 102ff 
Newton’s, 127 

second law, 12, 13 
parabola, 312 
Poisson’s, 160, 207 

field in homogeneous dielectric, 173 
field In magnetic, 107 
for vector potential, 179IT 
wave, 249 

four-dimensional, 249 
generalized, 260 
Euler, L., 105 

Expansion, potential, in multipoles, 162 

Extremals, 307 

Field, 

electric, see Electric field 
electromagnetic, see Electromagnetic field 
electrostatic, in vacuum, 157ff 
infinite solenoid, 182ff 
invariants, 225ff 
line current, 181 
magnetic, see Magnetic field 
in magnetics, 194 ff 

point charge in homogeneous dielectric. 

172 

inside solenoid, 1841 
outside solenoid, 185f 
strength, 157ff 
in dielectric, 167 
point charge, 158 
Flux, 

magnetic induction, 177 
momentum, 211 
electromagnetic wave, 254f 
Force(s), 

centrifugal, of inertia, 61 
conservative, 16 
Coriolis, 61 
dissipative, 18 
driving, 70 

complex amplitude, 70 
electromotive, induced, 199 
generalized, 22, 24 
pendulum, 30 
generalized-potential, 18 
inertia, 61 
Lorentz, 177 
Minkowski, 137f 
time component, 137f 
non-potential, 66 
potential, 16 
stationary, 16 

Forms, quadratic, see Quadratic forin(s> 
Formula(s), see also Equation(s) 
field transformation, 222ff 
Rutherford’s, 57 
Four-gradient, 400ff 
Four-momentum, 152 
closed system, 152 
covariant component, 146 
Four-operator, del, 400 
contravariant components, 401 
Four-potential, electromagnetic field, 219. 

221, 277 

Four- tensors, 40 3ff 
antisymmetric, 404f 
invariant, 406f 
Four-vector, 1 34f, 398f 
charge-current, 217 
components, 398f 
contravariant, 399, 401f 
covariant, 399, 402 

covariant components, transformation. 

402 « 
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Four-vector, 
current, 21 7f 
momentum-energy, 140 
scalar product, 399, 400 
spatial component, 398 
square, 398, 400 
time component, 398 
Four-velocity, 1 35f 
Frame, reference, see Reference frame 
Frequencies, natural, 74 
Function, 

Dirac delta, 412f 
Hamiltonian, 116 
Lagrangian, 11, see also Lagrangian 
non-periodic, 41 3f, 416 
periodic, 413, 416 
Rayleigh’s dissipative, 18f 
Functional, 300ff 
continuous, 302 
extremum, 304, 306ff 
linear, 301 
maximum, 305 
minimum, 305 
variation, 302f 


Gauge, 

Coulomb, 207 
Lorentz, 206 
transverse, 207 
Gauging, potentials, 204 
Gradient, 370ff 

in curvilinear coordinates, 389f 
four-dimensional, 400ff 
in n-dimensional space, 371 


Hamilton, W. R., 33, 115, 371 
Hamiltonian, 116, 118 
relativistic expression, 143 
Homogeneity, 36 
Hyperbola, 309 
canonical equation, 309f 
eccentricity, 310 
equation, 312f 


Index(ices), 

dummy, 16, 20, 371, 393 
free, 393 

refractive, complex, 264f 
Induction, 
electric, 170 

magnetic, 177, 194, 196f 
Integral(s), 

Fourier, 417 

motion, 25f, 37, 113, 119, 120 
Interval, 127ff, 144 
space-like, 129 
time-like, 129 
Invariance, gauge, 205 
Invariants, field, 225ff 
Isotropy, 36 


Kernel, integral transform, 418 
Kovalevskaya, S. V., Ill 


Lagrange, J. L., Ill 
Lagrangian, lif, 17, 18, 23f 
charged particle in field, 245 
density, 148 
for field, 239 
and energy, 24ff 
for field, 236 

for free relativistic particle, 143F 


Lagrangian, 

in generalized coordinates, 27f 
for non-inertia 1 reference frames, 57ff 
pendulum, 30, 31, 32 
in polar coordinates, 41 
rigid body, lOlf 

for uniformly rotating coordinate system, 
61 

Law, 

Biot-Savart, 187 
conservation, 

angular momentum, 39f 
charge, 218 
energy, 361 
momentum, 37U 
electromagnetic induction, 199f 
Length, Lorentz contraction, 133 
Line(s). 

coordinate, 387 
nodal, 85 
world, 127 


Magnetic field, 289f 
arbitrarily moving charge, 283 
dipole radiation, 293 
quadrupole radiation, 294 
uniformly moving charge, 275f 
Magnetization, 194 
diamagnetics, 196 
paramagnetics, 196 
Mass, 

reduced, system, 47 
relativistic, 142 
rest, 142 
Matrix(ces), 330ff 
algebra, 334ff 
asymmetric, 333 
characteristic equation, 35 1 
column, 333 
commutative, 335 
degenerate, 331 
diagonal, 333f 
difference, 334 
elements, 330 
identity, 334 
inverse, 331 
as linear operator, 332 
multiplication, 334 ff 
orthogonal, 331, 338 
quadratic form, 348, 350 
rank, 346 
row, 333 
singular, 331 
skew-symmetric, 333 
square, 332, 346 
sum, 334 
symmetric, 333 
trace, 334 
transformation, 330 
transposed, 331, 340 
unit, 334, 337 
Maxwell, J. C., 200, 201 
Mechanics, relativistic, 127 
Minor, determinant, 340 
Moment(s), 
dipole, 162 
inertia, 
axial, 91 
centrifugal, 91 
principal, 91 
magnetic, 192f, 295 
plane loop with current, 193 
system of discrete charges, 192 
quadrupole, 296 
Momentum, 
angular, 62 
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Momentum, 
four-dimensional, 139f 
generalized, 24 
pendulum, 30 

relativistic particle, 141f, 144 
system, density, 154 
three-dimensional, relativistic formula, 
139 

for uniformly rotating coordinate system, 
61 

Monopoles, 

Dirac’s, 177 
field potential, 165 
Motion, 
aperiodic, 68f 
finite, 45 
infinite, 45 
Multipole, 162 
field potential, 165f 
first-order, 162 


Number, 

complex wave, 261, 263 
degrees of freedom, 15 
Nutation, 87, 1 1 3f 


Octupole, 166 
Operator, 

d’Alembertian, 207, 217 
del, 371 

application, 379ff 
integral determination, 386f 
Hamiltonian, 371 
Laplaeian, 38 1 f 
in curvilinear coordinates, 392 
Oscillations, 
damped, 66ff 
forced, 70ff 
free, 64ff 
harmonic, 65f 
complex amplitude, 66 
normal, 76 
small-amplitude, 64ff 


Parabola, 310 
canonical equation, 310 
eccentricity, 310 
equation, 312 
Parameter, 
focal, 311 
impact, 54, 56 
Particle(s), - 

capture, 49 

charged, in electromagnetic field, 
action, 232ff 
motion, 230 

deflection angle, 52, 54 
and Impact parameter, 54 
divergence angle, 52f 
elastic collisions, 49ff 
energy, 245 
in central field, 42 
in field, Hamiltonian, 246f 
generalized momentum, 245 
head-on collision, 53 
motion in central force field, 4iff 
moving along straight line, 32f 
generalized force, 33 
generalized momentum, 33 
Lagrangian, 33 

in non-inertial reference frame, 57ff 
recoil angle, 52 
relativistic, 143ff 
rest energy, 141 


Particles, 

scattering, 49, 53ff 
scattering angle, 52 
total energy, 140 
trajectory, 42ff 

equation in polar coordinates, 43ff 
velocities after collision, 51 
Pendulum(s), 

with constantly accelerating suspension 
point, 31f 
coupled, 77ff 
kinetic energy, 77 
natural frequency, 78 
normal coordinates, 79 
potential energy, 77 
simple, 30 

with uniformly moving suspension point, 
30f 

Permeability, 196 
Permittivity, 171 
complex, 263 
relative, 171 
Phase, wave, 256 
Point, world, 127 
Polarization, dielectric, 1 67 f , 356f 
Postulates, Einstein’s, 125 
Potential(s), 

calculated in dipole approximation, 288 
field, 

dipole, 163, 165 
electrostatic, 158f 
monopole, 165 
multipole, 165f 
point charge, 158 
quadrupole, 165 
generalized, 18 
Lienard-Wiechart, 278 
magnetic, 194 
retarded, 27 If 

field of charge system, 284 
system, 16 
vector, 194ff, 29 1 ff 

magnetic field, 178, 189 
Power, dipole radiation, 291, 295 
Precession, 

pseudoregular, 114 
regular, 87, 108 
Principle, 

constancy of light speed, 125f 
Einstein’s, relativity, 125, 127 
Galileo’s, mechanical, relativity, 125f 
Hamilton, 33ff 

least action, 13, 33ff, 117, 233 
variational, mechanics, 34 
Problem, Lagrange’s, 11 If 
Process, aperiodic, 68 
Pseudoscalars, 325, 328 
Pseudotensor, 364, 
contraction, 365 
Pseudovector, 324f, 328, 369 


Quadratic form(s), 347ft 
canonical, 348 
diagonal, 348 
discriminant, 348 
matrix, 348, 350 
negative definite, 347, 348 
positive definite, 347, 348 
simultaneous reduction to diagonal form, 
352f 

Quadrupole, 163f 
moment, 164f 


Radiation, 
dipole, 290f 
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Radiation, 

magnetic dipole, 293, 296 
quadrupole, 2941 
Reaction, constraint, 15 
Reference frame, 
c-, 50 

centre-of-mass, 50 
50 

laboratory, 50 

Retardation, proper, 284, 286 
Rigid body, 

angular momentum, 95ff 
proper, 96 

degrees of freedom, 82 
elementary displacement, 82 
equations of motion, 102ff 
kinematics, 82ff 
Rule, Cramer’s, 343f 


Scalars, 359 
true, 325 

Scattering, differential effective cross sec- 
tion, 54f, 57 
Schrddinger, E., 124 
Series, Fourier, 4 1 3ff 
in complex form, 414f 
Space, 

configuration, 19, 34 
pseuao-Euclidean, 394 
9-, 19 

Strength, magnetic field, 196, 197 
Surface(s), 
constant ip, 370 
coordinate, 387 
Susceptibility, 
electric, 171 
magnetic, 196 
Symbol, Kronecker, 319 
skew-symmetric, 319 
System(s), 
closed, 

four-momentum, 152 
energy, 37 
conservative, 16, 36 
coordinate, see Reference frame 
energy-momentum tensor, 152ff 
holonomic, 14, 20 
with ideal constraints, 15f . 
mechanical, 11 

number of degrees of freedom, 15 

potential, 16 

potential energy, 16 

reduced mass, 47 

retarded time, 284 

total energy, 25 

two interacting particles, 4 5ff 


Tensor(s), 355ff 

antisymmetric, 3G2f 
of second rank, 369f 
components, 358 
diagonal, 358 
contraction, 360 
dielectric susceptibility, 358 
electrical susceptibility, 175 
electromagnetic field, 220f 
ellipsoid, 366, 368 
energy-momentum, 152H 

for electromagnetic field, 240ff 
four-dimensional, 403ff 
inertia, 91 ft 
inner product, 361 
invariant, 360 
as linear operator, 362 
magnetic susceptibility, 198 


Tensors, 

Maxwell stress, 215, 254 
metric, 393, 406 
multiplication, 360 
in n-dimensional space, 359 
permittivity, 175 
principal values, 359, 367f 
rank, 359 
scalar product, 361 
second rank, 358, 365ff, 36911 
stress, 155 
sum, 360 
symmetric, 362f 
as operator, 368 
of second rank, 365ff 
trace, 227, 360, 405, 406 
true, 3631 
unit, 359, 406 
Theorem, 

Euler’s, for homogeneous functions, 300- 
Gauss’s, 159, 177 

Gauss’s divergence, see Theorem, Ostro- 
gradsky-Gauss 

Ostrogradsky-Gauss, 375, 407ff 
for pseudo-Euclidean four-space, 407f 
parallel axis, 94 
Steiner, 94 
Stokes’s, 157, 379 
Theory, relativity, special, 125 
Time, 

proper, body, 129 
retarded, system, 284 
Top, 

asymmetrical, 93 
rapid, 114 
spherical, 92 

Euler’s equations, 107 
symmetrical, 93, 114 
free, 107ff 

in homogeneous gravitational field, lit 
unbalanced, lllff 
Trace, 

matrices, 334 

tensor, 227, 360, 405, 400 
Trajectory, motion, 121 
Transform, Fourier, 418 
Inverse, 418 

Transformations, Lorentz, 132f 


Variation, function, 301 
Vec torts), 316ff, 359 
acceleration, 135 
circulation, 375 
components, 323 

transformations, 325f 
cross product, 317 
dot product, 316 
flux, 372 

four-, see l'our-vector 

four-dimensional, 398, see also Four-vector 
four-position, 130ff 
tree, 318 

increment in rotation, 328H 
n-, 327 

in n-dimensional space, 344 
Poyntlng’s, 210, 253f, 291 
scalar product, 316, 322, 327 
for n-dimensional space, 327 
scalar triple product, 317, 325, 32& 
square of vector product, 318 
true, 324f 

vector product, 317, 323 
vector triple product, 317 
velocity, 135 
wave, 255 

four-dimensional, 256 
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Velocity! ies), 
centre ol mass, 50 
tour-dimensional, 1351 
generalized, 12, 19 
Volume, contraction. 133 


Wave(s), 

attenuation factor, 263 
average intensity, 2661 
electromagnetic, 249 
magnetic field, 265 
monochromatic, 255 


Waves, 

non-monochromatic, 265ff 
plane, 250 

in conducting medium, 26013 
polarized, 

circularly, 259 
eiliptically, 259 
linearly, 260 

spectral decomposition. 265 
Work, dissipative torces, 1U 


Zone, wave, 288 
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Fundamentals of Theoretical rnysics 
two volumes) is a logical 
^continuation of the author's thrbS* 

I' volume general course of physics. 
Everything possible has been done 
to avoid repeating what is contained 
Tn the three-volume course. 

The first volume is devoted 
to mechanics and electrodynamics. 1 
|- and the second volume to quantum 
..mechanics. - s 
An appreciable difficulty appearing 
in studying theoretical physics 
is that quite often many mathematical 

t topics have either never been studied 
by the. reader or have been forgotte 
by him. To eliminate this difficulty, 
both volumes are furnished with 
detailed mathematical appendices.;' 
The book has been conceived asij 
training aid for students of non- 
: theoretical specialities of higher 
I? educational' institutions, iitrc \if 
* ft will also be helpful for physics : 
instructors at higher schools, ' i> > 
tlhnd for everyone interested in the 
i: subject but having no time to become 
I: acquainted with it using 
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